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WD- 40 - DERIVED PEPTIDES AND USES THEREOF 

Field of the Invention 

The present invention relates in general to 
compositions and methods of modulating the function of proteins 
5 involved in protein-protein interactions. It relates more 

specifically to modulating the function of a first protein of a 
pair of interacting proteins wherein a second protein of the 
pair contains a "WD-40" or "S-transducin" amino acid repeat 
motif . 

10 Background Art 

Many intracellular processes are carried out or 
regulated by multi-subunit protein complexes that become active 
or repressed by the association or dissociation of individual 
polypeptide subunits. 

15 One such group or family of proteins is related to the 

S subunit of transducin. Members of this group are all at least 
somewhat homologous to the B- subunit of transducin at the amino 
acid level, and contain a varying number of repeats of a 
particular motif identified in S-transducin. The repeats have 

20 been termed " £- transducin " , or" "WD-40" repeats (Fong, et al.). 

Among the members of this protein family (Duronio, et 
al.) are the Gj3 subunits that couple many receptors to their 
intracellular effector molecules, G/J/7 subunits that anchor 
another protein kinase (the /3-adrenergic receptor kinase, j3ARK) , 

25 DNA binding proteins and yeast cell cycle proteins. All of 

these require a transient protein-protein interaction for their 
function. However, the sequences at the interface of these 
proteins and their partners have not been identified. 

The following are the references cited above and 



Crea, R., U.S. Patent No. 4,888,286, issued December 

19, 1989. 



WO 95/21252 PCT/US95/01210 

- 2 - 

Yoshio, T., et al . , U.S. Patent No. 4,849,350, issued 
July 18, 1989. 



Ausubel, F. M. , et al . , Current Protocols in Molecular 
5 Biology, John Wiley and Sons, Inc., Media PA. 

Bohinski , R . C . , Modern Concepts in Biochemistry , 
Second Edition, Allyn and Bacon, Inc. 

Dayhoff, M.O., in Atlas of Protein Sequence and 
Structure (1972) Vol. 5, National Biomedical Research 
10 Foundation, pp. 101-110, and Supplement 2 to this volume, pp. 1- 
10. 

Duronio, R.J., et al., (1992) Proteins: Structure, 
Function, and Genetics 13:41-56. 

Escobedo, J. A., et al . , Mol . Cell. Biol., 11:1125-1132 

15 (1991) . 

Fong, et al . , (1986) Proc Natl Acad Sci USA 83:2162- 

2166 . 

Hari, et al., Endocrinology, 120:829-831 (1987). 

Kleuss, C., et al . , Science 259:832-834 (1993). 
20 Makowske, O.M. and Rosen, O.M. J. Biol. Chem. 

264 :16155-16159 (1989) 

Maniatis, T. , et al., Molecular Cloning: A Laboratory 
Manual , Cold Spring Harbor Laboratory (1982) . 

Miller, J.F., et al., Nature (London) 216:659-63 

25 (1969) . 

Mochly-Rosen, D., and Koshland, D. E., Jr. J. Biol. 
Chem. 262:2291-2297 (1987). 

Mochly-Rosen, et al . , Molec. Biol. Cell. 1:693-706 

(1990) . 

30 Mochly-Rosen, D., et al., Proc. Natl. Acad. Sci. USA 



35 



Pitcher, J., et al . , Science 257:1264-1267 (1992) 
Reiner, et al . , Nature 364:717-721 (1993). 
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Smith, B.L. and Mochly-Rosen, D. Biochem. Biophys . 
Res. Coimnun. 188:1235-1240 (1992). 

Smith, D.B., et al . , Gene 67:31 (1988). 
Stith, B.J. and J.L. Mailer. Exp. Cell. Res. 169:514- 
5 523 (1987) . 

Wolf, M. and N. Sahyoun, Chem. , 261:13327-13332 

(1986) . 

Disclosure of the Invention 

The invention includes, in one aspect, a polypeptide 

10 composition effective to alter the activity of a first protein, 
such as protein kinase C, or /3-adrenergic receptor kinase 
(0ARK) . The polypeptide blocks or inhibits an interaction, such 
as a binding interaction, between the first protein and a second 
protein containing a WD-40 region. 

15 The polypeptide contains between 4 and 50 amino acids 

whose sequence is the same as a sequence of the same length in 
the WD-40 region of the second protein. 

The polypeptide may block the binding of the first to 
the second protein, or may be an agonist or antagonist of the 

20 first protein. The WD-40 region preferably has an amino acid 

sequence homologous or identical to the sequences defined by SEQ 
ID NO:76-261. 

In a second embodiment, the invention includes a 
method of altering the activity of the first protein of the type 

25 defined above. The method includes selecting a polypeptide 

having between 4 and 50 amino acids whose sequence is the same 
as a sequence of the same length in the WD-4 0 region of the 
second protein, and contacting the polypeptide with the first 
protein under conditions which allow the formation of a complex 

30 between the polypeptide and the first protein, where this 



35 



xii another embodiment, the contacting 
the activity of the first protein. 



is effective to stimulate 
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The polypeptide preferably has an amino acid sequence 
homologous or identical to the sequences defined by SEQ ID 
NO:76-261. 

In a more specific aspect of the invention, the 
5 invention includes a polypeptide composition effective to alter 
the activity of protein kinase C, where the protein kinase C 
interacts with a second protein, and the second protein contains 
at least one WD-40 region. The polypeptide has between 4 and 50 
amino acids whose sequence is the same as a sequence of the same 
10 length in the WD-40 region of the second protein. 

In a preferred embodiment, the second protein is a 
receptor for activated protein kinase C, and has the sequence 
represented by SEQ ID NO: 27. 

In other specific embodiments, the polypeptide is (i) 
15 an agonist of protein kinase C, and the polypeptide has the 

sequence represented by SEQ ID NO: 7; (ii) an antagonist of the 
activity of protein kinase C; and/or (iii) an inhibitor of the 
interaction between protein kinase C and the second protein. In 
the latter embodiment, the polypeptide has sequence 
20 corresponding to SEQ ID NO: 4 or SEQ ID NO: 7. 

The WD-4 0 region preferably has an amino acid sequence 
homologous or identical to SEQ ID NO: 69-75. 

In a related embodiment, the invention includes a 
method of altering the activity of a protein kinase C that 
25 interacts with a second protein, where said second protein 
contains at least one WD-40 region. 

The method includes selecting a polypeptide having 
between 4 and 50 amino acids whose sequence is the same as a 
sequence of the same length in the WD-40 region of the second 
30 protein, and contacting the polypeptide with the protein kinase 
C under conditions which allow the formation of a complex 

jo compositions of the invention wherein said polypeptide is 
coupled to a solid support, as well as a method to bind 
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polypeptide composition bound to solid support and removing any- 
unbound components of the sample from said composition. 

In still another aspect, the invention relates to a 
method to assess the interaction of a first protein with a 
5 polypeptide represented by an amino acid sequence contained in a 
second protein, wherein said second protein contains at least 
one WD-40 region, which method comprises contacting a sample 
containing said first protein with a polypeptide composition 
wherein the polypeptide has between 4 and 50 amino acids whose 

10 sequence is the same as the sequence of the same length in the 
WD-4 0 region of the second protein, and observing any 
interaction of the first protein with said polypeptide 
composition. The invention also concerns a method to assess the 
ability of a candidate compound to bind a first protein which 

15 method comprises contacting said first protein with a 

polypeptide composition which binds said first protein, wherein 
the polypeptide of said composition has between 4 and 50 amino 
acids whose sequence is the same as a sequence of the same 
length in a WD-40 region of a second protein which interacts 

2 0 with said first protein, in the presence and absence of said 
candidate compound; and measuring the binding of said 
polypeptide in the presence and in the absence of said 
candidate, wherein decreased binding of the polypeptide in the 
presence as opposed to the absence of said candidate indicates 

25 that said candidate binds to said first protein. 

In still another aspect, the invention is directed to 
recombinant materials for the production of the polypeptides of 
the invention and methods for their production. 

These and other objects and features of the invention 

30 will become more fully apparent when the following detailed 
description of the invention is read in conjunction with the 



Figure 1A shows the cDNA sequence of rat brain RACK] 
35 Figure IB shows an amino acid self -homology matrix 
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Figure 1C shows the amino acid sequence of RACK1, 
aligned to show the seven WD-4 0 repeats represented in the 
molecule. 

Figure 2 shows the results of an overlay assay to 
5 detect PKC binding to immobilized RACK1 in the presence and 
absence of PKC activators. 

Figure 3 shows the results of an overlay assay to 
detect PKC binding to immobilized RACK1 in the presence and 
absence of WD-40-derived peptides. 
10 Figure 4 shows the results of an overlay assay to 

detect binding of 0PKC to either peptide I (SEQ ID NO:l) or 
peptide rVI (SEQ ID NO: 7) immobilized on nitrocellulose 
membranes under various conditions. 

Figure 5A shows the effects of injecting peptides I 
15 (SEQ ID NO:l) and rVI (SEQ ID NO: 7) on PKC-mediated germinal 
vesicle breakdown (GVBD), a measure of * insulin- induced oocyte 
maturation . 

Figure 5B shows the effects of injecting peptides I 
(SEQ ID NO:l) and rVI (SEQ ID NO: 7) on PKC-mediated germinal 
20 vesicle breakdown (GVBD) in the absence of insulin induction. 

Figure 5C shows the effects of injecting peptide rill 
(SEQ ID NO:4) on PKC-mediated germinal vesicle breakdown (GVBD) 
in the absence of insulin induction. 

Figure 6 shows the distribution of jSPKC in Xenopus 
25 oocytes between the cytosolic and membrane-associated fractions 
following microinjection of either injection solution, peptide I 
(SEQ ID NO:l) or peptide rVI (SEQ ID NO:7) with or without 
insulin stimulation. 

Figure 7 shows the effects of peptides I and rVI on 
30 the sensitivity of 0PKC to Arg-C endopeptidase . 

Figure 8 shows the effects of peptides I and rVI on 



■i> d activators . 

Figure 10 shows the effects of peptide rill on PKC 
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Figure 11 shows the amino acid sequence of the 56 kDa 
human protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 12 shows the amino acid sequence of the AAC- 
5 rich protein with the WD-4 0 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 13 shows the amino acid sequence of the B-TRCP 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
10 Figure 14 shows the amino acid sequence of the Beta- 

prime-COP protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 15 shows the amino acid sequence of the CDC4 
protein with the WD-40 repeats aligned and putative binding 
15 peptide regions delineated by a box. 

Figure 16 shows the amino acid sequence of the Chlam-3 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 17 shows the amino acid sequence of the COP-1 
20 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 18 shows the amino acid sequence of the CORO 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
25 Figure 19 shows the amino acid sequence of the Coronin 

p55 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 20 shows the amino acid sequence of the Cstf 50 
kDa protein with the WD-40 repeats aligned and putative binding 
3 0 peptide regions delineated by a box. 

Figure 21 shows the amino acid sequence of the bovine 



ib G-beta-2 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 
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Figure 2 3 shows the amino acid sequence of the 
drosophila G-beta protein with the WD-4 0 repeats aligned and 
putative binding peptide regions delineated by a box. 

Figure 24 shows the amino acid sequence of the human 
5 G-beta-1 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 25 shows the amino acid sequence of the human 
G-beta-2 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 
10 Figure 2 6 shows the amino acid sequence of the mouse 

G-beta protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 2 7 shows the amino acid sequence of the 
drosophila groucho protein with the WD-4 0 repeats aligned and 
15 putative binding peptide regions delineated by a box. 

Figure 2 8 shows the amino acid sequence of the squid 
GTP-binding protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 2 9 shows the amino acid sequence of the HSIEF 
20 930 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 30 shows the amino acid sequence of the human 
12.3 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
25 Figure 31 shows the amino acid sequence of the human 

IEF-7442 protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 3 2 shows the amino acid sequence of the 
insulin- like growth factor binding protein complex with the WD- 
3 0 4 0 repeats aligned and putative binding peptide regions 
delineated by a box. 



joy a box. 

Figure 34 shows the amino acid sequence of the human 
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Figure 35 shows the amino acid sequence of the MD6 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 36 shows the amino acid sequence of the yeast 
MSI1 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 37 shows the amino acid sequence of the mouse 
pc326 MUS protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 3 8 shows the amino acid sequence of the ORD RBI 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 39 shows the amino acid sequence of the 
periodic trp protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 40 shows the amino acid sequence of the PLAP 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 41 shows the amino acid sequence of the 
retinoblastoma binding protein with the WD-40 repeats aligned 
and putative binding peptide regions delineated by a box. 

Figure 42 shows the amino acid sequence of the S253 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 43 shows the amino acid sequence of the SOF1 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 44 shows the amino acid sequence of the STE4 
yeast protein with the WD-40 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 45 shows the amino acid sequence of the TF1 



protein with tne WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
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Figure 4 7 shows the amino acid sequence of the TUP1 
homolog protein with the WD-4 0 repeats aligned and putative 
binding peptide regions delineated by a box. 

Figure 4 8 shows the amino acid sequence of the YCU7 
5 protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 49 shows the amino acid sequence of the YCW2 
protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 
10 Figure 50 shows the amino acid sequence of the YKL25 

protein with the WD-40 repeats aligned and putative binding 
peptide regions delineated by a box. 

Figure 51 shows the amino acid sequence of the YRB14 0 
protein with the WD-40 repeats aligned and putative binding 
15 peptide regions delineated by a box. 

Detailed Description of the Invention 

1 • Definitions 

Unless otherwise indicated, all terms used herein have 

the same meaning as they would to one skilled in the art of the 
20 present invention. Practitioners are particularly directed to 

Current Protocols in Molecular Biology (Ausubel) for definitions 

and terms of the art . 

Abbreviations for amino acid residues are the standard 

3-letter and/or 1-letter codes used in the art to refer to one 
25 of the 20 common L-amino acids. Likewise, abbreviations for 

nucleic acids are the standard codes used in the art. 

An "amino acid group" refers to a group of amino acids 

where the group is based on common properties, such as 

hydrophobicity , charge, or size. 
3 0 A "conserved set" of amino acids refers to a 



conserved set is between two and ten contiguous residues in 
35 length. The individual positions within a conserved set each 
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at a particular position, the conserved set sequence will 
contain only that residue at that position. For example, for 
the two peptides WRTAA (SEQ ID NO: 263) and WRTAV (SEQ ID 
NO:264), there are 4 identical positions (V7RTA; SEQ ID NO:265) 
and one position where the residue is an "A" or a "V" . 

Proteins are typically long chains of amino acid based 
polyamides (polypeptides) capable of creating secondary and 
tertiary structure. Proteins may be composed of one, two or 
more polypeptide chains and may further contain same other type 
of substance in association with the polypeptide chain (s) , such 
as metal ions or carbohydrates. The size of proteins covers a 
rather wide range from -5,000 to several hundred thousand 
g/mole. The 5,000 figure corresponds to the presence or roughly 
40-45 amino acids. 

Unless otherwise indicated, the sequence for proteins 
and peptides is given in the order from the amino terminus to 
the carboxyl terminus. Similarly, the sequence for nucleic 
acids is given in the order from the 5' end to the 3' end. 

The term "interacting proteins" refers to a pair of 
polypeptides that can form a stably-associated complex due to, 
for example, electrostatic, hydrophobic, ionic and/or hydrogen- 
bond interactions under physiological conditions. 

Proteins smaller than about 5,000 g/mole are typically 
referred to as polypeptides or simply peptides (Bohinski) . 

Two amino acid sequences or two nucleotide sequences 
are considered homologous (as this term is preferably used in 
this specification) if they have an alignment score of >5 (in 
standard deviation units) using the program ALIGN with the 
mutation gap matrix and a gap penalty of 6 or greater (Dayhof f ) . 
The two sequences (or parts thereof) are more preferably 
homologous if their amino acids are greater than or equal to 



parent: peptide or polypeptide if it has an amino acid sequence 
that is identical or homologous to the amino acid sequence of 
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ID N0:4) and peptide rVI (SEQ ID NO:7), which are derived from 
the third and seventh WD-40 repeats of RACK1 (SEQ ID NO:27), 
respectively. 

The term "expression vector" refers to vectors that 
5 have the ability to incorporate and express heterologous DNA 
fragments in a foreign cell. Many prokaryotic and eukaryotic 
expression vectors are commercially available. Selection of 
appropriate expression vectors is within the knowledge of those 
having skill in the art. 
10 The term "PKC" refers to protein kinase C, or C- 

kinase . 

The term "RACK" refers to receptor for activated C- 

kinase. 

The term "PS" refers to phosphatidylserine . 
15 The term "DG" refers to diacylglycerol . 

The term "PL" refers to phospholipids. Phospholipids 
include both phosphatidylserine and diacylglycerol . 

The term "GVBD" refers to germinal vesicle breakdown, 
a measure of insulin- induced maturation in Xenopus oocytes. 

2 0 The term "PCR" refers to polymerase chain reaction. 

The term "NMR" refers to nuclear magnetic resonance. 
The term " (3 ARK 1 ' refers to ^-adrenergic receptor 

kinase . 

II . General Overview of Invention. 
25 The invention relates to interacting proteins, at 

least one of which contains an amino acid sequence with one or 
more of the characteristic repeats termed WD-40 (Fong, et al . ) . 

According to one aspect of the invention, the function 
of a first protein of a pair of interacting proteins may be 

3 0 modulated, altered or disrupted by the addition, to a solution 



pair of interacting proteins. 
35 The modulation or disruption of function of the first 
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protein. The consequences of the binding or association of the 
binding peptide with the first protein depend on the sequence of 
the peptide. 

Typically, the presence of the binding peptide will 
5 inhibit the binding of the first protein to the second protein. 
This binding may be assayed in vitro by, for example, an overlay 
assay, whereby the degree of binding of one protein to another 
may be assessed. Several adaptations of overlay assays applied 
to embodiments of the present invention arp described herein. 

10 Regardless of whether or not the WD-4 0 -derived peptide 

affects the association of the first protein with the second 
protein, the peptide may alter or modulate defined activities of 
the first protein. These activities may be assayed by a variety 
of methods in vivo and/or in vitro. The method (s) employed 

15 depend on the protein whose activity is being measured. 

An exemplary first protein of a pair of interacting 
proteins is protein kinase C (PKC) . Upon activation, PKC 
interacts with receptors for activated C kinase (RACKs) , at 
least one of which (RACK1) contains WD-40 repeats. Several 

2 0 assays for determining the activity of PKC in the presence and 
in the absence of peptides derived from the WD-40 region of 
RACK1 are detailed herein. 

Certain "interacting proteins" interact only after one 
or more of them has been stimulated by an exogenous or 

25 endogenous factor(s). For instance, PKC, as shown herein, does 
not bind to RACK proteins until it has been activated by, for 
example, phosphatydilserine (PS) , diacylglycerol (DG) and 
calcium. However, peptides derived from WD-4 0 repeats of a 
second protein of such a pair may be able to associate with or 

30 bind to the first protein even in the absence of activators of 
the first protein, and in so doing, affect the function of the 

r ^ v- _ * , .. .. . * ..... ; , . ' . . 



3:j secona protein of a pair of interacting proteins, may be useful 
as specific agonists, antagonists, potentiators of function, and 
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example, direct use in therapeutic applications or as lead 
compounds for the development of other therapeutic agents, e.g., 
small organic molecules. 

Ill . Advantages of the Invention for the Inhibition of Act ivated 
5 PKC Binding to RACK1 . 

Protein kinase C (PKC) is a family of at least 10 
isozymes that share common structures and biochemical 
characteristics. It has been demonstrated that several isozymes 
are present within a single cell type, and it has been assumed 

10 that individual PKC isozymes are involved in different cellular 
functions. However, so far, the available activators and 
inhibitors of PKC do not appear to be isozyme-specif ic . 
Therefore, it is currently impossible to determine the role of 
individual PKC isozymes in normal cellular functions as well as 

15 in disease. 

PKC activation by, for example, diacylglycerol and 
calcium, induces the translocation of PKC from a soluble 
(cytosolic) to a cell particulate (membrane-associated) 
fraction, as shown in experiments herein (Example 8) . Activated 

20 PKC is stabilized in the cell particulate fraction by binding to 
membrane-associated receptors (receptors for activated C-Kinase, 
or RACKs) . 

In experiments done in support of the present 
invention and described herein, a clone (pRACKl) encoding a RACK 

25 has been isolated (Example 1) . RACK1 belongs to a growing 
family of proteins that are homologous to the fi-subunit of 
transducin and contain the WD-40 motif (Fong, et al.) . It was 
demonstrated that peptide I (SEQ ID NO:l) binds to purified PKC 
(see Example 6 and Fig. 4) , inhibits the binding of PKC to 

30 purified recombinant RACK1 protein (see Example 4 and Fig. 3), 



identified in the sixth WD-40 repeats of RACK1 (see Fig. 1C) . A 
35 synthetic peptide was prepared based on this sequence (peptide 
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corresponding regions in repeats I-V and VII (peptides rl-rV, 
rVII; SEQ ID NO: 2 -6, 8; underlined regions in corresponding 
repeats, Fig. 1C) . Some of the peptides were also found to 
inhibit the binding of PKC to RACK1 (see Example 4 and Fig. 3) . 
5 In addition, some of the peptides were found to bind to purified 
PKC (see Example 6, Fig. 4), partially activate PKC in the 
absence of other activators (peptide rVI; see Examples 7, 10, 11 
and Figs. 5, 8 and 9), and potentiate the effects of known PKC 
activators on the enzyme (see Examples 7-9 and Figs. 5-7). 
10 In Xenopus oocyte maturation studies (see, for 

instance, Example 7), peptide rVI (SEQ ID N0:7) is an agonist of 
0PKC. Peptide rill, while less potent, is also an agonist of 
PKC; it enhances insulin- induced oocyte maturation at 50 and 
500/xM. 

15 In cardiac myocytes, norepinephrine (NE, 2/iM) causes 

translocation of 6 and ePKC isozymes from the cytosolic to the 
particulate fraction. Introduction into cardiac myocytes of 
peptide rill, and to a lesser extent peptide rVI, caused an 
immediate translocation of 6 and ePKC isozymes in the absence of 

2 0 hormone stimulation. This peptide- induced translocation was 
followed by degradation of 6 and ePKC isozymes. Moreover, 
NE-induced translocation is further enhanced in cells containing 
peptide rill. 

In contrast, introduction of peptide I to these cells does 
25 not affect PKC distribution in the absence of hormone 

stimulation, nor does it induce PKC degradation. Furthermore, 
NE-induced translocation is inhibited by peptide I. Similar 
concentrations of a number of control peptides did not affect 
PKC distribution or degradation in control or NE-treated cells. 
30 in studies on rat cardiac myocytes, peptide rill 

induced 6 PKC and ePKC activation that was followed by 
deQradaf i on r^-f hV^co n ---^ 0 -3 ■ , 



— y- c ,■ . in contrast, peptide I (SEQ ID NO:l) inhibited 
hormone -induced translocation of PKC isozymes (Example 8, Fig 6) 
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The data summarized above demonstrate that peptides 
derived from WD-40 repeats of RACK1 can serve as PKC agonists 
and antagonists in vivo, and suggest that peptides derived from 

WD-40 regions of RACK1 contain at least part of the protein- 
5 protein interface between PKC and RACK1 . 

Furthermore, the results suggest that (i) WD-40 
repeats present in other proteins, such as GjS subunit, may also 
be located at or near a surface involved in protein-protein 
interactions, (ii) peptides derived from these repeats may be 

10 effective in disrupting the interactions of the proteins with 
their partners (e.g. ^-adrenergic receptor kinase (/?ARK) , (iii) 
the peptides may modulate or alter the activity of the proteins 
with which the WD-4 0 repeat -containing proteins interact, and 
(iv) the peptides may therefore have specific biological effects 

15 when administered in vivo. 

IV. Identification of Pairs of Interacting Proteins. 
A. Biochemical Approaches . 

Novel interacting proteins may be identified and 
isolated by a number of methods known to those skilled in the 

20 art. For example, monoclonal antibodies raised to a mixture of 
antigens, such as a particular tissue homogenate, may be 
characterized and used to immunoprecipitate a single class of 
antigen molecules present in that tissue. The precipitated 
proteins may then be characterized further, and used to co- 

25 precipitate other proteins with which they normally interact 
(Hari, et al . , Escobedo, et al . ) . 

An alternate method to identify unknown polypeptides 
that interact with a known, isolated protein is by the use of, 
for example, an overlay assay (Wolf, et al . , Mochly-Rosen , et 

30 al . , 1991) . A mixture (such as a fraction of a tissue 



nitrocellulose or nylon membrane, and contacted with a solution 
35 containing the known protein and any necessary co-factors or 
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with a probe for the known protein, for example an antibody or a 
mixture of antibodies, and the signal visualized. 



B . Molecular Approaches . 

Putative binding proteins of a known protein may be 
5 isolated from tissue homogenates, as described above. 

Alternatively, DNA clones encoding putative binding proteins may 
be identified by screening, for example, an appropriate cDNA 
expression library. Expression libraries made from a wide 
variety of tissues are commercially available (for example, from 
10 Clonetech, Palo Alto, CA) . Expression libraries may also be 
made de novo from organisms and tissues of choice by 
practitioners skilled in the art. 

The screening of expression libraries for clones 
expressing a protein or protein fragment of interest may be 
15 readily accomplished using techniques known in the art, for 
example, an overlay assay. 

An overlay- assay screening method may be used to 
identify clones expressing a (known or unknown) protein or 
protein fragment that binds to a probe in hand. The probe may 
20 be a protein postulated to be involved in protein-protein 

interactions with a protein expected to be present in a cDNA 
library selected for screening (as was the case for the cloning 
of RACK1, detailed in Example 1) . 

Actual screening of a selected cDNA library may be 
25 accomplished by inducing plated clones to express cloned 
exogenous sequences, transferring replicas of the induced 
plaques or colonies to filter membranes, and screening the 
membranes with an appropriate probe. According to this method, 
lifts of filters (for example, nylon or nitrocellulose) from an 
3 0 appropriately- induced cDNA library plates (induced by, for 



present on the filters. The filters may then be washed and 
3 5 reacted with a reagent (for example, antibodies such as alkaline 
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Indianapolis, IN) . Additional reactions may be carried out as 
required to detect the presence of bound probe. 

One such overlay assay, described in Example 1, was 
used to screen a rat brain cDNA expression library for proteins 
5 that bind purified PKC in the presence of PKC activators 

(phosphatydilserine, diacylglycerol and calcium) . The filters 
were screened with a mixture of rat brain PKC isozymes (a, /?, 7, 
5, e and f) . Following a series of washes, bound PKC isozymes 
were detected with a mixture of anti-Gf, /?, y PKC mouse 
10 monoclonal antibodies, and anti-6, e and f PKC rabbit polyclonal 
antibodies. Bound antibodies were detected using alkaline 
phosphatase-conjugated goat anti-rabbit or anti-mouse antibodies 
and 5-bromo-4-chloro-3-indoyl phosphate p-toluidine salt as a 
substrate . 

15 Once a clone is identified in a screen such as the one 

described above, it can be isolated or plaque purified and 
sequenced. The insert may then be used in other cloning 
reactions, for example, cloning into an expression vector that 
enables efficient production of recombinant fusion protein. 

20 Examples of appropriate expression vectors are pGEX (Smith, et 
al. # 1988) and pMAL-c2 (New Ei^gland BioLabs, Beverly, MA). An 
expression vector containing an insert of interest may be used 
to transform appropriate host cells, such as E. coli, and the 
transformed host cells can be used to produce the recombinant 

25 protein in large amounts. 

Typically, a recombinant protein is expressed in 
tandem with a bacterial or viral gene product (endogenous 
polypeptide) as part of a fusion protein. The junction between 
the endogenous polypeptide and the recombinant protein typically 

30 includes a recognition site for a rare-cutting protease. The 
endogenous peptide may be designed to incorporate a unique 



35 recombinant protein may then be purified from the fusion protein 
using the appropriate protease. 
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peptides or substances that inhibit binding between the 
recombinant protein and an interacting protein. 

An example of the use of a cDNA clone to express 
protein is detailed in Example 2. RACK1 cDNA, isolated as 
5 described above and in Example 1, was subcloned into an 

expression vector (pMAL-c2, New England BioLabs, Beverly, MA) 
capable of expressing a cloned insert in tandem with maltose- 
binding protein (MBP) . The vector containing the RACK1 insert 
was used to transform TBI K. coli , which were then induced with 

10 IPTG. The cells produced a 78 kDa fusion protein comprised of 
RACK1 fused to the MBP. The overexpressed fusion protein was 
purified on an amylose affinity column according to the 
manufacture's protocol (New England BioLabs, Beverly, MA) and 
incubated with protease Xa to separate the expressed insert from 

15 the MBP. Following the incubation, a 36 kDa RACK1 protein was 
obtained . 



V. Identification of WD-40 Repeats. 

According to a method of the present invention, 
protein-protein interactions can be disrupted and/or the 
2 0 activity of an interacting protein can be altered, given at 

least one of the interacting proteins contains a WD-4 0 motif, or 
region, with a peptide(s) derived from a WD-40 repeat (s) of one 
of the proteins. 

WD-40 repeats are typically found in a family of 
25 proteins having at least a limited homology with the S subunit 
of transducin. WD-40 repeats present in a selected member of 
this family can be identified by (A) performing a self -homology 
analysis on a selected protein using a homology matrix 
(performed by, for example, the computer program DNA Strider 
30 1.2, available from Christian Marck, Service de Biochemie et de 



revealed by the homology matrix analysis, and (C) identifying 
35 conserved amino acid residues that typically serve to define a 
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A. Homology matrix analysis. 

Determining whether a particular amino acid sequence 
contains repeated motifs may be accomplished by a number of 

methods known to those skilled in the art. They range from a 
5 simple visual inspection of the sequence to the use of computer 
programs which can identify repeated motifs. One widely- 
implemented computer-assisted method is to generate a self- 
homology matrix. A self -homology matrix computes the homology 
of each amino acid residue in a particular sequence with pvpry 

10 other residue in that sequence. The homology scores are stored 
in a 2 -dimensional matrix. 

Values higher than a selected criterion level are 
flagged and displayed as points on an x-y coordinate. The x- 
and y-axes correspond to consecutive amino acid positions in the 

15 sequence. 

An example of a self -homology matrix analysis is shown 
in Figure IB. The matrix was generated using the computer 
program DNA Strider 1.2 (Christian Marck, Service de Biochemie 
et de Genetique Moleculaire, Department de Biologie Cellulaire 

20 et Moleculaire, Direction des Sciences de la Vie - CEA - FRANCE) 
with the amino acid sequence of RACK1 (SEQ ID NO: 27) with a 
window setting of 21 and a stringency of 6. Some typical 
features of a self -homology matrix are evident in the figure. 
The graph shows a "primary" diagonal line extending from the 

25 origin with a slope of unity, corresponding to the fact that the 
sequence is identical to itself. If the sequence contains 
repeating elements, as RACK1 does, there will be other, shorter 
sets of contiguous points arranged in diagonal lines 
substantially parallel to the primary diagonal and offset from 

30 the primary diagonal in the x- or y-directions . These shorter 
lines identify the locations of repeating elements with the 



The data displayed in a homology matrix analysis can 
be used to locate and roughly align the sequences of repeating 
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Fig. IB highlights the fact that portions of that region of 
RACK1 , that is, the amino acids between about amino acid 100 and 
amino acid 130, are repeated a total of seven times in the 
sequence of RACK1 . Arrows point to the repeats in the homology 
5 matrix. For purposes of rough alignment, the short diagonal 
lines pointed out by the arrows can be extended to the 
horizontal line at amino acid -100 on the y-axis, and the x-axis 
location corresponding to the intersection be noted. For 
example, the intersection corresponding to the second repeat 

10 (second arrow from the left) is at x=~50) . 

Values determined in this manner may then be used to 
align the amino acid sequence of the repeats with each 
consecutive repeat beneath the preceding one, the start of each 
repeat corresponding approximately to the amino acid position 

15 determined by the analysis in the preceding paragraph. The 

amino acid sequence of RACK1, aligned in this manner, is shown 
in Fig. 1C. 

Most commercially-available DNA and protein sequence 
analysis programs have the capability to perform a self -homology 

2 0 matrix analysis. One example is the program DNA Strider 1.2 
(Christian Marck, Service de Biochemie et de Genetique 
Moleculaire, Department de Biologie Cellulaire et Moleculaire, 
Direction des Sciences de la Vie - CEA - FRANCE) . 

Once the repeating elements are identified and the 

25 sequences corresponding to repeating elements are roughly 

aligned, one may proceed to define the degree of homology among 
the individual repeats at the specific positions within the 
repeats, as is described below. 



B • Aligning amino acid sequences. 
30 If a self -homology matrix was used to obtain a crude 



programs effective to accomplish an alignment include "MACDRAW 
35 PRO" (Claris Corp., Santa Clara, CA) and "WORD" (Microsoft 
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well as IBM-compatible computers running "WINDOWS" (Microsoft 
Corp. ) . 

Amino acid sequences corresponding to internal repeats 
can also be aligned automatically using a protein sequence 
5 analysis program, such as "MACVECTOR" (Eastman Kodak Co., New 
Haven, CT) . 

According to a method of the invention, aligned 
sequences are examined further to determine if they fulfil 
criteria to be defined as WD-40 repeats. These criteria are 
10 detailed in part C, below. 

C . Amino acid residues that define a WD-40 repeat. 

Upon completion of steps outlined in parts A and B 
above, that is, determining whether a particular protein 
contains internal repeats, and if so, aligning those repeats, it 

15 is necessary to determine whether the aligned repeats contain 
WD-4 0 regions. 

A WD-4 0 motif is roughly defined as a contiguous 
sequence of about 25 to 50 amino acids with relatively-well 
conserved sets of amino acids at the two ends (amino- and carboxyl- 

20 terminal) of the sequence. Conserved sets of at least one WD- 
40 repeat of a WD-40 repeat -containing protein typically contain 
conserved amino acids at certain positions. The amino- terminal 
set, comprised of two contiguous amino acids, often contains a 
Gly followed by a His. The carboxyl -terminal set, comprised of 

25 six to eight contiguous amino acids, typically contains an Asp 
at its first position, and a Trp followed by an Asp at its last 
two positions. 

A more accurate definition of a WD-40 motif 
incorporates the observation that while specific residues, such 

30 as those identified above, are not always conserved within a WD- 



In order to better define the class of conserved 
35 residues at selected positions, it is necessary to group amino 
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is to analyze the normalized frequencies of amino acid changes 
between corresponding proteins of homologous organisms (Schulz) . 
According to such analyses, groups of amino acids may be defined 
where amino acids within a group exchange preferentially with 
5 each other, and therefore resemble each other most in their 
impact on the overall protein structure (Schulz) . Examples of 
amino acid groups defined in this manner, some of which are used 
in the definition of a WD-40 motif herein, include: 

(i) a charged group, consisting of Glu and Asp, Lys. Arg 
10 and His, 

(ii) a positively-charged group, consisting of Lys, Arg 
and His, 

(iii) a negatively-charged group, consisting of Glu and 
Asp, 

15 (iv) an aromatic group, consisting of Phe, Tyr and Trp, 

(v) a nitrogen ring group, consisting of His and Trp, 

(vi) a large aliphatic nonpolar group, consisting of Val, 
Leu and lie, 

(vii) a slightly-polar group, consisting of Met and Cys, 
20 (viii) a small-residue group, consisting of Ser, Thr, Asp, 

Asn, Gly, Ala, Glu, Gin and Pro, 

(ix) an aliphatic group consisting of Val, Leu, lie, Met 
and Cys, and 

(x) a small hydroxyl group consisting of Ser and Thr. 

25 In addition to the groups presented above, each amino 

acid residue may form its own group, and the group formed by an 
individual amino acid may be referred to simply by the one 
and/or three letter abbreviation for that amino acid commonly 
used in the art. 

A "WD-40" motif is defined herein as a contiguous set 
of amino acids between (inclusive) two sets of relatively well 
conserved residue?! *-&rrrnA v 0 >~o-;- 



30 



- - '-^ ■ ies^.aue at the rirst position is typically selected 

from groups ii, vi or viii, while the residue at the second 
do s i t i or. i ^ t~ vr i r* ^ 1 i v nr.^-^r,-] r v . 
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respectively. The Gly and His residues are typically present in 
at least one of the aligned repeats of a WD-40-containing 
protein. 

The carboxyl- terminal conserved set typically includes 
5 eight residues, but may contain as few as six residues. The 
most well -conserved residue in WD-40 motifs identified thus far 
is an Asp residue, comprising the first amino acid of the 
carboxyl -terminal conserved set. It is present in virtually all 
WD-40 repeats illustrated herein. In those repeats where it is 
10 not present, the position is occupied by a residue from groups 
iii or Gly. 

The last two amino acids in the carboxyl -terminal 
conserved set are typically selected from groups iv or lie, and 
groups i or viii, respectively. The most commonly used residue 

15 at the first of these positions is Trp. It is typically present 
in at least one of the WD-40 repeats of any given protein. The 
second position is occupied less consistently by a single 
residue, but is often occupied by Asp. The Trp-Asp (WD) 
combination is part of the namesake of WD-40 repeats. 

20 The amino acids present in the internal portion of the 

carboxyl -terminal conserved set are less well -conserved than the 
terminal residues, and their total number may differ by up to 
two residues in different WD-40 repeats. The third position in 
from the carboxyl -terminal end of the carboxyl -terminal 

25 conserved set is typically selected from groups viii or ix, more 
typically ix. The fifth position in from the carboxyl -terminal 
end of the carboxyl -terminal conserved set is also typically 
selected from groups viii or ix, more typically ix. 

The length of a WD-40 repeat, including the amino- 

3 0 terminal and carboxyl -terminal conserved sets is typically 

between about 2 5 and about 50 residues, more typically between 

j~ The number of WD-40 repeats in a particular protein 

can range from two to more than eight. The average number is 
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A determination of whether or not a set of aligned 
internal repeats are WD-40 repeats can be facilitated by an 
examination of all of the repeats as a whole, rather than an 
examination of each repeat individually. This is in part 
5 because not all of the aligned repeats will necessarily contain 
all of the conserved sequences that serve to identify WD-4 0 
repeats, although the conserved residues will typically appear 
in at least one of the repeats. 

For example, Fig. 1C shows the RACK1 amino arid 

10 sequence aligned to illustrate the internal repeats present in 
the sequence. All of the repeats are WD-4 0 repeats, even though 
the amino- terminal conserved set of repeat VI, for instance, 
contains an "LD" as opposed to the more usual "GH" , and the 
carboxyl- terminal conserved set contains a "G" at its first 

15 position, as opposed to the highly- conserved "D" . Similarly, 

the carboxyl -conserved set of, for example, repeat I, contains a 
"WK" at the last to positions, as opposed to the more usual 
"WD" . 

It will be appreciated that certain residues or sets 
20 of residues will be well -conserved in the WD-40 repeats of a 

selected protein, even though they may not be conserved in WD-40 
repeats in general. Such residues or sets of residues may be 
useful in several ways. For example, they may be used in 
performing an alignment of internal repeats in a selected 
25 protein, as described in part B, above. The residues may also 
be useful for identifying regions based on which effective 
binding peptides may be designed (see section VI., below) . 

D. Identification of WD-40 repeats in RACK1 . 

In experiments done in support of the present 
30 invention, a protein that binds to activated PKC was cloned and 



rhe aligned repeats were identified as WD-40 repeats 
35 by application of the criteria identified in parts A, B and C 
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repeats IV, VI and VII , the set consists of other residues. 
These other residues, however, are contained in at least one of 
the amino acid groups identified above as conserved at the 
appropriate position. The conserved carboxyl- terminal set 

5 contains the highly- conserved M D " at its first position in all 
repeats except repeat VI. The second-to-last position of this 
set contains the relatively-well conserved "W" in each repeat, 
while the last position contains the typical "D M in repeats II, 
V and VI, and other residues in the other repeats. 

0 Taken together, these data indicate that the repeats 

contained in RACK1 are WD-40 repeats. The data also illustrate 
that not all repeats contain all of the elements typical of a 
WD-40 motif, but that when the repeats are aligned and viewed 
together as a whole, a WD-4 0 motif is apparent in all repeats. 



15 E - Identification of WD-40 repeats in sequenced proteins. 

Data were compiled in support of the present invention 
to illustrate how WD-40 repeats in various proteins may be 
identified, and to illustrate the diversity of amino acid 
sequences that may be properly identified as WD-40 repeats by 

20 those skilled in the art following the guidance set forth 

herein. Two methods that were used to identify WD-40-containing 
protein sequences are detailed in Example 7 . 

In the first method, proteins identified in their 
description as having a homology to 0- transducin were examined 

25 as detailed in parts B-D, above; for WD-40 repeats. 30 proteins 
were identified in this manner. The amino acid sequences of 
these proteins, with the WD-40 regions aligned and delineated, 
are shown in Figs. 12-18, 20-27, 29-30, 34-35, 37-38, 40 and 42- 
50. The sequences are represented in the Sequence Listing as 

30 SEQ ID NO:29-35, 37-44, 46-47, 51-52, 54-55, 57 and 59-67. 



proteins containing WD-40 repeats were identified with this 
strategy. The amino acid sequences of those proteins, with the 



WO 95/21252 



PCT/US95/01210 



- 27 - 

the Sequence Listing as SEQ ID NO:28, 36, 45, 48-50, 53, 56, 58. 
and 68. 

Other types of searches may be equally effective at 
identifying proteins which may contain WD-4 0 repeats. For 
5 example, on-line databases such as GenBank or SwissProt can be 
searched, either with an entire sequence of a WD-40-containing 
protein, or with a consensus WD-40 repeat sequence. Various 
search algorithms and/or programs may be used, including FASTA, 
BLAST or ENTREZ. FASTA and BLAST are available as a part of the 

10 GCG sequence analysis package (University of Wisconsin, Madison, 
Wisconsin) . ENTREZ is available through the National Center for 
Biotechnology Information, National Library of Medicine, 
National Institutes of Health, Bethesda, MD. 

Sequences identified with a protein homology search 

15 are then analyzed as described in parts A, B and C, above, to 
identify potential WD-40 motifs. Once located, the motifs can 
be aligned, and effective binding peptides may be designed. 

F. Identification of WD-40 regions in novel polypeptides . 
WD-40 repeats may be identified in a novel polypeptide 
20 by, for example, the methods described in parts A-D above. It 
will be appreciated, however, that step A above (homology 
matrix) is not required in the identification of WD-40 repeats. 
Following the guidance of the present invention, one skilled in 
the art may, for instance, identify a WD-40 motif while scanning 
25 the sequence of some, perhaps novel, polypeptide merely through 
a recognition of one or more of the features characteristic of 
WD-40 repeats. 

The precise methods by which one skilled in the art 
arrives at the conclusion that a particular motif is a WD-40 
3 0 repeat is less relevant to the present invention than is the use 



proteins and/or to disrupt protein-protein interactions. 
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Upon the alignment and recognition of WD-40 repeats in 
a particular protein, one may proceed to design a peptide or a 
set of peptides that may be effective to associate with or bind 
to the protein with which the WD-40-containing protein normally 
5 associates. Such a binding or association may be expected to 
alter or modulate the activity of the protein and/or disrupt the 
association of the pair of interacting proteins. 

The sequence of such a peptide will typically be 
homologous, if not identical to, a contiguous amino acid 
10 sequence contained within at least one of the WD-40 repeats. 

Examples of the selection of WD-40-derived peptides effective to 
disrupt protein-protein interactions are detailed in parts C and 
D below, for RACK-PKC and Gj3/7~/3ARK interactions, respectively. 

A. Choosing an appropriate region within a WD-40 repeat. 

15 Putative binding peptides may be selected from any 

portion of a WD-40 repeat. If it is desired to obtain a degree 
of discrimination between the various WD-40 -containing proteins, 
peptides should be chosen from the region between, and not 
including, the amino-terminal and carboxyl- terminal conserved 

20 sets. This "central region" typically shows greater sequence 
diversity between different WD-40-containing proteins than the 
terminal regions, and is roughly outlined by boxes in Figures 
11-51, which show the amino acid sequences and aligned WD-40 
repeats of various WD-40 repeat-containing proteins. Within the 

25 central region, peptides should be selected from sequences that 
have little or no homology to any other known sequences, save 
the sequence (s) of the protein (s) targeted for disruption. 

For example, peptides rill (SEQ ID NO: 4, seven amino 
acids) and rVI (SEQ ID NO:7, eight amino acids), are identical 

30 to segments of RACK1 WD-40 repeats (III and VI, respectively) 



Binding peptides comprise the left portion of the central region 
35 of the respective WD-40 repeats, and are not well -conserved in 
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If it is desired to inhibit the interactions of, for 
example, all of the isoforms of a particular WD-4 0 -containing 
protein family, a sequences is selected that includes a 
significant number of residues that are shared or highly 
5 homologous among at least one WD-4 0 repeat of each of the 
targeted isoforms. 

If, on the other hand, an isof orm-specif ic reagent is 
desired, a sequence is selected from a WD-40 repeat (s) of a 
specific isoform, where that sequence does not include s 
10 significant number of residues that are identical or highly 

homologous to residues in WD-40 sequences from related isoforms. 

B . Choosing an appropriate length for a peptide. 

Effective binding peptides may be designed that range 
in length from as few as about four residues to 4 0 or more 
15 residues. Preferably, binding peptides will have a length of at 
least about six residues, and less than about 2 0 residues. The 
length will be determined in part by the degree of desired 
homology to other WD-4 0 repeats, as described in part A above, 
and by the level of discrimination between proteins that is 
20 required. 

For example, binding peptides selected from RACK1 
sequences to inhibit RACK1/PKC interactions were seven and eight 
amino acids in length. The peptides are long enough to bind 
specifically to the targeted sequences, but short enough to not 
25 cross-react with other WD-40 repeat binding proteins. These 
properties enable the peptides to have very selective and 
specific effects, as is shown below in Examples 6-11. 

C Design of RACK1 WD-40-derived peptides to inhibit 
RACK1-PKC interactions. 



identical to segments of RACK1 WD-40 repeat sequences beginning 
five amino acids in from the amino termini of the WD-40 repeats 
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of the central region of the WD-40 repeats. The peptides were 
tested for their ability to disrupt protein-protein interactions 
in vitro and in vivo, as described in section VII and Examples 
6-11 below. 

5 D. Peptides derived from WD-4 0 repeats of Human G-Beta 

inhibit interactions of G-Beta subunits with 5ARK. 
Methods described in section V part E were used to 
identify WD-40 repeats (SEQ ID NO: 128 -134) in Human G-Beta (SEQ 
ID NO: 41) . Segments from the first six WD-40 repeats were 
10 selected for the design of G-beta binding peptides (SEQ ID 
NO:13-18). The segments were selected based on criteria 
detailed in parts A and B, above. 

The G-beta binding peptides are used to disrupt the 
interactions of G-beta subunits with 0ARK. The disruption is 
15 assayed using a modification of the overlay assay described in 
Example 4 . 

VII . Testing of Putative Binding Peptides. 

Detailed below are several assays by which the 
efficacy of WD-40-derived peptides at binding to a target 
20 protein, inhibiting protein-protein interactions, and altering 
or modulating the activity of a target protein may be 
determined . 

One class of assays, widely-used to assess the binding 
of two proteins to each other, are overlay assays. Overlay 
25 assays are generally applicable to most proteins. They can be 
used to, for example, assess the binding of WD-40-derived 
peptides to their targets, as shown in Example 6 and described 
in part B below. Overlay assays can also be used to assess the 
ability of WD-40-derived peptides to inhibit the binding of two 



Other assays may be used to assess effects of WD-40- 
derived peptides on the activity of the target protein. These 
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the proteins involved and on the system(s) and/or process (es) 
that involve the interacting proteins against which the peptide 
was targeted. For instance, the assays described in parts D-I 
below are appropriate for characterizing PKC activity in vivo 
5 and in vitro. 

While many of the assays below are particularly useful 
for characterizing the activity of PKC, they also illustrate a 
general framework of experiments by which the effects of WD-40 
derived peptides on other proteins may be assessed, 

10 A - Overlay assays to evaluate efficacy of putative 

binding peptides derived from WD-40 regions. 
An overlay assay can be used to assess the disruption 
of the ability of a pair of proteins to associate. Methods for 
conducting overlay assays are well-known in the art (see, for 
15 example, Mochly-Rosen, et al . , 1991). 

Applications of overlay assays to evaluate putative 
binding peptides for PKC/RACK1 interactions are presented in 
Examples 4 and 5 herein. The assays can be generally described 
as follows. 

20 One protein of a pair of interacting proteins 

("immobilized" protein) can be resolved on an SDS/PAGE gel and 
blotted onto an appropriate membrane (for example, 
nitrocellulose or nylon) by methods known to those skilled in 
the art. The blots may then be contacted with a solution 

25 containing the other protein of the pair of interacting proteins 
("overlay" protein) in the presence, and in the absence of 
putative binding peptides. Following appropriate wash steps, 
bound overlay protein can be detected by the use of an 
appropriate probe, such as an antibody directed against the 

30 overlay protein. 



proLein. The modification consists of performing another 
3 5 SDS/PAGE electrophoresis between the steps of binding the 
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pieces sized to just encompass the area occupied by the blotted 
immobilized protein, after the overlay protein had been 
contacted (in the presence or in the absence of binding 
peptides) and allowed to bind to the blot. The pieces of 
5 membrane are then incubated in a sample buffer, placed in the 
wells of a second SDS polyacrylamide gel and subjected to 
electrophoresis . 

Following electrophoresis, the gel is blotted as 
above, and contacted with a probe, for example antibodies, to 
10 detect bound overlay protein. 

B. Binding of flPKC to peptides homologous to a WD-40 
region of RACK1 . 

The binding of 0PKC to peptide I (SEQ ID NO:!), 
peptide rVI (SEQ ID NO: 7) and control peptide (SEQ ID NO: 9) was 
15 assessed in Example 6 using a PKC overlay assay similar to that 
described in Example 3 . Increasing amounts of peptides were 
applied onto nitrocellulose using a slot-blot apparatus. The 
membranes were incubated with PKC in the presence and absence of 
PS, DG, and calcium. 

2 0 The data are shown in Figure 4, and show that 

activated PKC bound to both peptides I and rVI at peptide 
amounts as low as 5 ptmoles, but not to the control peptide. 
Unactivated PKC did not bind to peptide I, but did bind to 
peptide rVI at similar concentrations. 
25 The results indicate that while the peptides were 

homologous to one another and were capable of binding to the 
same protein, they behaved differently. Peptide rVI (SEQ ID 
NO: 7; 8 residues) was able to bind to both activated as well as 
unactivated forms of PKC, whereas peptide I (SEQ ID N0:1; 15 

3 0 residues) could bind only to activated PKC. The differences 



WO 95/21252 



PCTYUS95/01210 



- 33 - 

C. Effects of peptides homologous to WD-4 0 region of 
RACKl on PKC bind ing to RACK1 

Two peptides (peptide rill; SEQ ID NO: 4 and peptide 
rVI; SEQ ID NO: 7) identical to regions of RACKl WD-40 repeats 
5 (underlined, Figure 1C) were tested for their ability to inhibit 
PKC binding to recombinant RACKl using a modification of the 
overlay procedure referred to above. The experiment is detailed 
in Example 4 and the results are shown in Figure 3 . 

Peptide I caused an 81±6% inhibition of PKC binding tn 

10 recombinant RACKl as compared with binding in the absence of 

added peptide. Both peptides rill and rVI inhibited the binding 
of PKC to RACKl. In addition, peptides rl and rll were also 
effective inhibitors of the interaction of PKC to RACKl. A 
lesser inhibitory effect was obtained with peptides rIV and rV 

15 and no inhibition was obtained with peptide rVII. 

The difference in the peptide's ability to inhibit 
binding may reflect differences in the roles played by the 
corresponding WD-4 0 repeats in the protein-protein interactions 
between PKC and RACKl. The peptide's ability or inability to 

20 inhibit protein-protein interactions as assayed by an overlay 
assay, however, is not necessarily correlated with the effects 
those peptides may have on the activity of the targeted 
proteins, as measured by both in vivo and in vitro assays and 
described in parts D-I below. 

2 5 D . Effects of peptides homologous to WD-40 regions of 

RACKl on PKC-mediated oocyte maturation. 
Peptides I (SEQ ID NO:l), rill (SEQ ID NO:4) and rVI 
(SEQ ID NO: 7) were also tested for their ability to affect 
insulin-induced, PKC-mediated maturation in Xenopus oocytes, as 
30 detailed in Example 7 and shown in Figures 5A and 5C. 

DVT* -i irtr^l vo^ - *~ «- V r- or - * 



oocyte maturation m the absence of normones . Exposure to 
3 5 insulin causes an increase in diacylglycerol levels and 
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proteins causes a significant decrease in the rate of oocyte 
maturation (Smith, et al . , 1992). The insulin- induced oocyte 
maturation assay therefore provides an effective in vivo assay 
for compounds that interfere with the function of PKC. 
5 The maturation response was quantified by monitoring 

the appearance of a white spot in the animal hemisphere of the 
oocyte, indicating germinal vesicle breakdown (GVBD) and 
maturation. The indicated peptides were microinjected into 
Xenopus oocytes and the percent of oocytes with GVBD following 
10 insulin exposure was plotted as a function of time in Figures 5A 
and C. 

Approximately 80-85% of sham-injected (control) 
oocytes exposed to insulin reach maturation, as compared with 
45-50% of oocytes injected with peptide I. The rate of 

15 maturation of those oocytes that did mature was similar in the 
two cases. In contrast the effects of peptide I, both peptides 
rill and rVI potentiated the effects of insulin on oocyte 
maturation, both in terms of the rate of maturation, and in the 
total fraction of oocytes that mature during the experiment. 

20 Injection of peptides rill or rVI increases the fraction of 

maturing oocytes to essentially 100%. Furthermore, peptide rVI 
induced oocyte maturation in the absence of insulin stimulation 
(Fig. 5B) . 

Together, the data above indicate that peptides 
25 homologous to the WD-40 region of RACK1 can modulate the 

function of a protein with which RACK1 interacts (e.g. PKC), 
that the modulation can occur in vivo, and that it can have 
clear and profound physiological consequences. Furthermore, the 
results with peptide rVI suggest that under appropriate 
30 circumstances, the peptide alone may act to activate PKC , in the 
absence of other activating substances. 



Insulin causes the redistribution of 0PKC, but not 
3 5 other PKC isozymes, from a cytosolic form to a membrane- 
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To assess the effects of RACK1 WD-40-derived peptides on 
insulin- induced PKC translocation, 50 nl of a 20 mM NaCl 
solution containing the indicated peptides were microinjected 
into Xenopus oocytes. The oocytes were then homogenized, and 
5 the relative amount of PKC in the soluble and particulate 

fractions was assayed. The protocol followed was a modification 
of a method described by Smith, et al (1992) . The results are 
shown in Figure 6 . 

Peptide I (50 jiM) did not affect #PKC distribution in 

10 untreated oocytes, but inhibited insulin- induced /?PKC 

translocation (Fig. 3, lanes 7,8). In contrast, peptide rVI (50 
lM) induced 0PKC translocation in the absence of insulin 
treatment (Fig. 3, lanes 3,4). These results suggest that 
peptide I is an antagonist of hormone- induced PKC translocation, 

15 whereas peptide rVI is an agonist and an activator of PKC 

translocation. In light of the results presented in Example 7, 
the data also suggest that the inhibition of insulin-induced 
GVBD following microinjection of peptide I was due to an 
inhibition of jSPKC translocation. 

20 . F. Effects of peptides homologous to WD-40 regions of 

RACK1 on sensitivity of flPKC to Arg-C endopeptidase. 
Upon activation of PKC, a pseudosubstrate 
autoinhibitory sequence at the N- terminus of PKC dissociates 
from the catalytic site and renders the molecule sensitive to 

25 endopeptidase Arg-C (Orr, et al.). Exposure of activated 0PKC 
to Arg-C results in a limited proteolysis, or "nicking" of the 
enzyme. The nicking typically generates a 78 kDa fragment and 
several small fragments. Continued exposure to Arg-C typically 
results in the disappearance of 0PKC (Orr, et al . ) . 

30 Since peptides rill (SEQ ID NO: 4) and rVI (SEQ ID 

^ manner ic maKe it susceptible to endopept iaase Arg-C. The 
35 experiments are detailed in Example 9 and the results are shown 
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In the presence of effective concentrations of PKC 
activators (0.8 /xg/ml DG, 50 jig/ml PS and 1 mM CaCl 2 ) , exposure 
of 0PKC to Arg-C resulted in nicking, generating the 78 kDa 
fragment (Fig. 7, lane 2). In the absence of PKC activators, 
exposure of 0PKC (80 kDa) to endopeptidase Arg-C had no effect 
on the enzyme (Fig 7, lane 1) . 

Incubation of 0PKC with Arg-C at low concentrations of 
activators (2.5 /zg/ml PS and 50 /xM CaCl 2 ) in the absence of added 
peptide, in the presence of control peptide (SEQ ID NO: 9) and in 
the presence of peptide I (SEQ ID N0:1) did not result in 
appreciable nicking activity (Fig. 7, lanes 4, 8 and 9, 
respectively) . However, incubation of /JPKC with the same low 
concentration of activators in the presence of peptides rill or 
rVI resulted in the appearance of the 78 kDa nicked PKC fragment 
15 (effects of peptide rVI in Fig. 4, lanes 5-7). Concentrations 
as low as 10 nM of peptide rVI were sufficient to result in 
nicking activity, indicative of 0PKC activation. 

The results indicate that peptides rill and rVI, but 
not peptide I, are effective to stabilize PKC in an activated 
conformation that renders it susceptible to Arg-C under 
conditions of low PKC activators that would otherwise not render 
the enzyme susceptible to Arg-C. 

G * Effects of pepti des homologous to WD-4 0 regions of 
RACK1 on flPKC autophosphorvlat ion . 

Activated PKC is capable of autophosphorylation, which 
can be assayed by incubation with [y- 32 p]ATP and visualized on an 
autoradiograph of a gel. Anti-pseudosubstrate antibodies were 
shown previously to induce autophosphorylation in the absence of 
PKC activators (Makowske, et al . ) . Since peptide rVI (SEQ ID 
NO: 7) was effective to induce PKC translocation and GVBD in the 



20 



25 



30 



ana the data are shown in Figure 8 . 
35 PKC activated with PS (50 fig/ml) , DG (0.8 jzg/ml) and 
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(lane 2), or in the absence of PKC activators with peptide I 
(SEQ ID N0:1; lane 5) or control peptide (SEQ ID N0:9; lane 6). 
In contrast, peptide rVI in the absence of PKC activators 
induced PKC autophosphorylation to over 8 0% of the levels 
5 obtained for PKC alone in the presence of optimal concentration 
of PS, DG, and calcium (compare Fig. 8 lane 1 (control) with 
lane 4 (peptide rVI) ) . 

H. Effects of peptides homologous to WD-40 regions of 
RACK1 on histone phosphorylation by flPKC. 

10 Another measure of PKC activity is the ability of 

activated PKC enzyme to phosphorylate histones. PKC 
phosphorylation of histone was carried out using a modification 
of the protocol described by Mochly-Rosen, et al . , (1987) . 
Phosphorylation was carried out in the presence or absence of 

15 PKC activators (PS, DG and calcium) and RACKl-derived peptides. 
Phosphorylated histone was detected by autoradiography, 
following SDS-PAGE on a 10% gel. 

Since peptide rVI (SEQ ID NO: 7) was effective to 
induce the autophosphorylation of PKC in the absence of PKC 

2 0 activators, and both peptides rill (SEQ ID NO: 4) and rVI 

rendered PKC susceptible to proteolysis by Arg-C, experiments 
were performed to characterize the effect of the peptides on 
histone type III phosphorylation by PKC. The experiments are 
detailed in Example 11 and the results are shown in Figures 9 

25 and 10. 

The results are similar to those obtained for the 
effects of peptide rVI on autophosphorylation of PKC, that is, 
peptide rVI was effective to induce PKC-mediated histone 
phosphorylation in the absence of the PKC activators PS, DG, and 
30 calcium, once again supporting that peptide rVI is an agonist of 



WO 95/21252 



PCTAJS95/01210 



- 38 - 

VIII. Utility. 

A. Peptides as probes for the identification of target 
proteins 

WD-40 derived peptides may be used, for example, to 
5 isolate clones encoding target proteins from an expression 

library. Variations on the cloning methods described herein can 
be used to identify clones expressing sequences capable of 
binding the peptides. For example, WC-4 0 derived peptides may 
be used to detect a target protein on a membrane using a 
10 standard binding assay. Positive clones may be detected, for 

example, by radiolabeling the peptides and exposing the membrane 
to film. 

Target proteins isolated in this manner may be 
completely novel, or they may be partially characterized (in 
15 terms of a biological activity in a homogenate, or a band on a 
protein gel, for example). 

Upon isolation of a cDNA encoding a binding protein, 
the cDNA may be expressed, for example, as detailed herein, and 
the protein may be characterized. Purified protein thus 
20 isolated may be used for a number of applications, including the 
production of antibodies. 

Peptides designed according a method of the present 
invention may also be used, for example, as probes in a Western 
blot of a tissue homogenate to identify and determine the 
25 molecular weight of known or putative target proteins. 

Screens such as those described above may be 
facilitated by the modification of peptides used for screening 
to incorporate any of a variety of reporter moieties. For 
example, the peptides can be radiolabeled with 125 I . 
30 Alternatively, the peptides can be modified with a sequence-tag 
or a ligand for an affinity column by methods known to those 

^ various arfinity reagent tor cross linking Known to those 
skilled in the art. This enables the isolation of target 
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B. Peptides as substitutes for defective WD-40 containing 
prot;eing T 

In cases where a WD-40 containing protein is 
implicated in a disease (see, for example Reiner, et al . ) , 
5 peptides derived from WD-40 regions of the defective protein may- 
be used as substitutes, for example, to activate a target 
enzyme. Such an approach may be more feasible than attempting 
therapy with intact proteins. The approach has an additional 
advantage in that it doe*? not require knowledge of the 
10 chromosomal location of the affected gene. 

The peptides can be introduced into affected cells by 
any of several methods known to those skilled in the art, 
including through the use of an appropriate expression vector or 
through in vitro synthesis and administration by an effective, 
15 expedient route. In vitro studies can be carried out using 
skinning or microinjection techniques. 

C . Peptides as pharmaceutical agents. 

WD-4 0 derived peptides of the present invention may be 
used therapeutically, as described above. Such peptides may be 
20 designed so as to interact with endogenous target molecules to 
augment or correct their function. Alternatively, peptides may 
be designed to specifically interact with target molecules 
unique to a pathogenic organism. 

D . Peptides as modulators of enzyme activity of proteins 
25 involved in protein-protein interactions. 

Peptides synthesized according to a method of the 
invention may be effective to modulate the function of a target 
molecule (e.g. serve as agonists or antagonists). As shown 
herein, for example, peptides rVTII and rVI can serve to 



other compounds which may affect the function of target 
molecules such as PKC . In particular, because WD-40 derived 
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useful for identifying molecules or compounds that may interfere 
with PKC function in vivo, but might not necessarily interfere 
with PKC in vitro. 

For example, peptide rVI can be used to stimulate PKC 
5 in the absence of traditional PKC activators, and the rVI- 
stimulated enzyme may be used in a screen to identify, for 
example, novel PKC- inhibiting or PKC-potentiating compounds. 

If constitutive activation or inactivation of a target 
enzyme is desired, peptides may be designed with integrated or 
10 derivatized cross-linking moieties. The peptides can be cross- 
linked to their targets upon binding such that the target 
molecule assumes the desired state of activity for the lifetime 
of the target molecule. 

Conversely, as described herein for PKC, peptides may 
15 also be designed so as to accelerate the degradation of the 
target molecule. For example, peptide rill accelerated the 
degradation of PKC in cardiac myocytes. 

E . WD-40 derived peptides as specific modulators of 
isozymes . 

20 Peptides designed according to a method of the present 

invention can also be used to provide target isozyme-specif ic 
modulator molecules. For example, most cells have several PKC 
isozymes, all of which are activated by the same cellular 
stimuli. Determining the function of the individual isozymes is 

25 therefore difficult. 

WD-40 derived peptides that selectively stimulate or 
inhibit specific target isozymes or groups of isozymes may be 
useful, both in terms of therapeutic value, and in terms of 
determining the roles of different isozymes in cellular function 

30 and disease. Such information can be useful for the 
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F. Compounds designed based on the predicted structure of 
binding peptides as pharmaceutical agents. 
Peptides derived from WD- 4 0 repeats may be useful for 
identifying lead compounds for drug development. Peptides as 
5 small as 7 residues have been shown herein to possess specific 
bioactivities upon interaction with their targets in vivo. The 
structure of such small peptides can be readily determined by a 
number of methods, such as NMR and X-ray crystallography. A 
comparison of the structures of peptides similar in sequence, 
10 but differing in the biological activities they elicit in the 
target molecules, can provide information about the structure- 
activity relationship (SAR) of the target enzyme. 

For example, peptide I and RACKl-derived peptides rill 
(SEQ ID NO:4) and rVI (SEQ ID NO:7) had opposite effect in vivo, 
15 although they are homologous in sequence. 

Information gleaned from the examination of structure - 
activity relationships can be used to design either modified 
peptides, or other small molecules or lead compounds which can 
be tested for predicted properties (e.g. agonist or antagonist), 
20 as related to the target enzyme. The activity of the lead 

compounds can be evaluated using assays similar to those used in 
the evaluation of peptide-binding effects. 

Information relating to a SAR of a target enzyme may 
also be obtained from co-crystallization studies. In such 
25 studies, a peptide with a desired activity is crystallized in 
association with a target protein, and the X-ray structure of 
the complex is determined. The structure can then be compared, 
for example, to the structure of the target protein in its 
native state, and information from such a comparison may be used 
30 to design compounds expected to possess specific activities. 
The compounds can be evaluated using assays similar to those 



35 



mutations in WD- 4 0 containing proteins. 

Results presented herein suggest that the middle 
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is likely to play a central role in the activity of a 
polypeptide complex comprised of interacting proteins, some 
genetic diseases may include mutations at these regions of WD-40 
containing proteins. Therefore, if a WD-4 0 containing 

5 protein is implicated in a genetic disorder, it may be possible 
to use PCR to amplify DNA from the WD-40 regions to quickly 
check if a mutation is contained within one of the WD-40 motifs. 
Primers can be made corresponding to either (i) the flanking 
regions of each repeat or (ii) the flanking regions of a series 

10 of tandem repeats from the affected gene. Standard sequencing 
techniques can be used to determine whether a mutation is 
present. This method does not require prior chromosome mapping 
of the affected gene and can save time by obviating the need to 
sequence the entire gene encoding a defective WD-4 0 protein. 

15 H. WD-40 based polypeptides as affin ity ligands 

Since the polypeptide compositions of the invention 
are able to bind proteins of interest, generically called a 
11 first protein", the polypeptide compositions can also be used 
to retrieve the proteins of interest from samples and the 

20 peptides can be used as affinity ligands for chromatographic 
procedures to purify and analyze said proteins. Standard 
chromatographic techniques are employed. Typically, the 
polypeptide is coupled to a solid support and the sample 
putatively containing the first protein is contacted with the 

25 polypeptide composition of the invention; any unbound components 
of the sample are removed and, if desired, the first protein, 
bound to support, is eluted and recovered. 



I . Use of peptides in screening tests for candidates 
Various candidate compounds, not necessarily 
3 0 polypeptides, may be shown to bind to a first protein using the 



with the polypeptide composition of the invention in the 
3 5 presence and absence of the candidate compound and evaluating 
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the polypeptide in the presence of the candidate indicates that 
the candidate binds to the first protein. 

More broadly, the interaction of a protein with a 
polypeptide subsequence contained in the second protein can be 
5 assessed by contacting the first protein with a polypeptide 

representing the subsequence and observing any interaction with 
the polypeptide composition. 

IX . Production of t he Peptides of the Invention 

The polypeptides of the invention can be prepared 

10 using standard techniques for the synthesis of peptides from 
amino acids. Such techniques, when conducted in solid phase 
chemistry are available commercially. 

The polypeptides of the invention may also be produced 
using recombinant methods. These methods are by now well known 

15 in the art; DNA molecules containing nucleotide sequences 

encoding the desired polypeptides can readily be synthesized and 
ligated into expression systems for production of the peptides 
as is understood in the art. A wide variety of hosts is 
available, including procaryotic and eucuryatic hosts. The 

20 construction of expression vectors, means to modify these hosts, 
and culturing the modified hosts for recombinant production of 
polypeptides are conducted using standard techniques. 

The following examples illustrate, but do not limit 
the present invention. 

2 5 Materials and Methods 

Nitrocellulose filters were obtained from Schleicher and 
Schuell (Keene, NH) . 

Synthetic peptides were prepared using commercially 
available automated peptide synthesizers. Alternatively, custom 



peptides. The oligonucleotide sequences may be either synthesized 
directly by standard methods of oligonucleotide synthesis, or, in 
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fragments corresponding to the coding sequence (Crea; Yoshio, et 
al.; Eaton, et al . ) . Oligonucleotide coding sequences can be 
expressed by standard recombinant procedures (Maniatis, et al . ; 
Ausubel , et al.) . 

5 "Triton" refers to a nonionic detergent comprising a 

polyoxyethylene ether and other surface-active compounds. An 
exemplary Triton detergent is "TRITON X-100", available from Sigma 
Chemical Company, St. Louis, MO. 

"Tween" refers to a nonionic detergent comprising 
10 polyoxyethylenesorbitan monolaurate with a fatty acid composition 
of approximately 55% lauric acid, with a balance composed primarily 
of myristic, palmitic and stearic acids. An exemplary Tween 
detergent is " TWEEN 20", available from Sigma Chemical Company, St. 
Louis, MO. 

15 "SDS" refers to sodium dodecyl sulfate. 

"PAGE" refers to polyacrylamide gel electrophoresis. 
"IPTG" refers to isopropyl S-D-thiogalactopyranoside . 

Example 1 

Expression Cloning of a PKC-bindinq Protein 
20 A. Buffers. 

Overlay block buffer: 50 mM Tris-HCl (pH 7.5), 0.2 M 
NaCl, 3% bovine serum albumin (BSA) and 0.1% polyethylene glycol. 

Overlay buffer: 50 mM Tris-HCl (pH 7.5), 0.2 M NaCl, 12 
mM 2 -mercaptoethanol , 0.1 % BSA, 1% polyethylene glycol, 10 /xg per 
25 ml soybean trypsin inhibitor and lO^xg per ml leupeptin. 

B . Isolation of a PKC-bindinq cDNA clone bv an overlay 
assay. 

A rat brain (Sprague Dawley) cDNA expression library, 
constructed in the lambda phage cloning vector "UNI -ZAP XR" 

1 n t Qf r^*-qrrpno T ~> T^l ~\ \ . T ~ • • 1 1 

xiorary piates were incubated for 2 hours in overlay block buffer. 
The filters were then transferred to overlay buffer with or without 
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at room temperature with PKC activators (60 M9/ ml 
phosphatidylserine (PS), 2 fig/ml diacylglycerol (DG) , 1 mM CaCl 2 ) 
Following three 15 minute washes in the overlay buffer, 
the filters were incubated in the overlay block buffer in the 
5 presence of a mixture of monoclonal anti-a, 0 and y PKC antibodies 
(1:1000 dilution; Seikagaku Kogyo, Tokyo, Japan) and polyclonal 
anti-6, e and f PKC antibodies (1:500 dilution; Life Technologies, 
Gaithersburg, MD) . After a 16 hr incubation at room temperature, 
the filters were washed three times, 15 minutes per wash, in 
10 overlay buffer. 

Binding of PKC was determined using alkaline phosphatase- 
conjugated goat anti-rabbit or goat anti-mouse antibodies (1:2000 
dilution, Boehringer Mannheim Biochemicals, Indianapolis, IN). The 
alkaline phosphatase reaction used 5-bromo-4-chloro-3 -indoyl 
15 phosphate p-toluidine salt as a substrate, and was performed 
following the manufacturer's protocol. 

Library screening of 2.4 x 10 6 recombinant "UNI -ZAP" 
lambda phage plaques yielded one clone, pRACKl, that reacted with 
ant i- PKC antibodies in the PKC overlay membrane, but not in the 
20 control overlay membrane. These results suggest that pRACKl 
encodes a PKC binding protein. 

i 

C . Cloning and sequencing cDNA from positive plaques. 

The clone pRACKl, identified as detailed in part B above, 
was plaque purified and cDNA inserts were isolated as phagemids by 
25 in vivo excision of the cloning vector, according to the 
manufacture's protocol (Stratagene, La Jolla, CA) . DNA sequencing 
of pRACKl was carried out using standard di-deoxy sequencing 
techniques (Maniatis, et al . ) The DNA sequence of RACK1 is shown 
in Figure 1A. The sequence is also contained in the Sequence 
30 Listing as SEQ ID NO: 19. 



coli 

A Pstl/Xhol DNA fragment containing an open reading frame 
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upstream of the initiating methionine was subcloned into E. coli 
expression vector pMAL-c2 (New England BioLabs, Beverly, MA) . This 
vector contains the malE gene, which encodes maltose-binding 
protein (MBP) . Induction of E. coli containing the vector results 
5 in the production of an MBP-fusion protein (Ausubel, et al . ) . The 
vector also includes a recognition site for the protease factor Xa, 
which allows the protein of interest to be cleaved from MBP after 
purification without adding any vector-derived residues to the 
protein. 

10 A culture of TBI E. coli transformed with RACK1- 

containing pMAL-c2 was induced by a 3 hr incubation with 1.8 mM 
IPTG. A protein fraction containing a 78 kDa fusion protein, 
comprised of RACK1 fused to MBP was isolated from the cultured E. 
coli by standard methods (Ausubel) . The fusion protein was 

15 purified on an amylose affinity column according to the 
manufacture's protocol (New England BioLabs, Beverly, MA) and 
incubated with protease Xa (New England BioLabs) to yield a 36 kDa 
protein (RACK1) and a 34 kDa protein (possibly a RACK1 degradation 
product) . 

20 Example 3 

Binding of PRC to Recombinant RACK1 

A. Buffers . 

PBS/Tween buffer: 140 mM NaCl, 8 mM Na 2 P0 4 , 1 . 5 mM KH 2 P0 4 , 
3 mM KC1 and 0.05% Tween at pH 7.0, 
25 Overlay wash buffer: 50 mM Tris-HCl (pH 7.5) , 0.2 M NaCl, 

12 mM 2-mercaptoethanol , 0.1% polyethylene glycol and 0 . 1 mM CaCl 2 . 

B. Overlay assay. 

Purified recombinant RACK1 protein (100-250 /xg per lane, 
produced as detailed in Example 2) was subjected to SDS/PAGE and 



of a mixture of PKC isozymes (a, (3 , y, b, e and f, -10 nM each 
final concentration) and PKC activators (60 M9/ m l 
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in overlay wash buffer. Where indicated, PKC activators were 
present during the incubation of PKC with the nitrocellulose 
strips. The conditions for each sample and corresponding results 
are presented in part D below. 

C, Detection of bound PKC. 

PKC bound to RACK1 immobilized on nitrocellulose strips 
was detected as follows. The strips were incubated for 16 hours 
at room temperature with a mixture of anti-PKC antibodies as 
detailed in part B of Example 1, and then washed three times, 15 
minutes per wash, with PBS/Tween buffer. The strips were incubated 
with anti-mouse and anti-rabbit horseradish peroxidase -linked 
secondary antibodies (Amersham Lif e Science, Arlington Heights, IL) 
diluted 1:1000 in PBS/Tween buffer supplements with 2% BSA, for 1 
hour at room temperature. After washing three times, 15 minutes 
per wash with PBS/Tween buffer, the strips were subjected to a 
chemiluminescent reaction with luminol (diacylhydrazide) as 
detailed in the mauf acturer ' s protocol (Amersham Life Science, 
Arlington Heights, IL) , followed by an immediate exposure to 
autoradiography film (Eastman Kodak, Rochester, NY) for 30 seconds 
to 5 minutes . 

D. Effects of PKC activation on PKC binding to RACK1 . 

The results presented in Figure 2 show the influence of 
PKC activators on the binding of PKC to RACK1 immobilized on 
nitrocellulose membranes. The overlay assay was carried out as 
described in part B above. The test reagents contained in each 
sample and the corresponding lanes on the blot presented in Fig. 
2 are as follows. Lane 1: PKC, 60 jig/ml PS, 2 /xg/ml DG and 1 mM 
CaCl 2 ; lane 2: PKC and 1 mM EGTA; lane 3: PKC, 60 /ig/ml PS and 2 
fig/ml DG; lane 4: PKC and 1 mM CaCl 2 ; lane 5: No PKC added; lanes 
6 and 7- PKC nrr/m"! vq o .» ~ . — 



-nr tt ^:.uepe::aenL experiments . 

It can be appreciated that the binding of PKC as detected 



WO 95/21252 



PCT/US95/01210 



- 48 - 

presence of phosphatidylserine (PS) and diacylglycerol (DG; lane 
3), and is maximal in the presence PS, DG and calcium (lane 1) . 
Antibody binding was not observed in the absence of added PKC (lane 
5) . Furthermore, maltose binding protein alone, or an extract from 
5 non-transformed E. coli did not bind PKC. 

The concentration dependence of PKC binding to RACKl was 
characterized with /3PKC, since this isozyme is a major component 
of the PKC mixture used for the overlay assay. The mean half 
maximal binding was -0.375 nM, and maximal binding was -4 nM (n=3; 
10 values reflect binding of jSPKC isozyme in the presence of other PKC 
isozymes and was determined by scanning autoradiograms in the 
linear range of detection, as described in Mochly-Rosen, et al . , 
(1991) . 

The results presented above indicate that in order for 
15 PKC to bind to RACKl it must be activated. In vitro, activation 
may be accomplished, for example, by phosphatidylserine and 
diacylglycerol, or, more preferably, by phosphatidylserine, 
diacylglycerol and calcium. 

Example 4 

20 Inhibition of PKC Binding to RACKl by RACKl-specif ic WD-40- 

homoloaous Peptides 
Assays for the inhibition of PKC binding to RACKl by 
putative binding peptides were carried out by combining a variation 
of the overlay protocol described in Example 3 part B above, with 

25 an overlay extraction assay described in part B below. The 
variation in the overlay protocol consisted of incubating the 
putative binding peptides with a mixture of PKC isozymes for 15 
minutes at room temperature before the mixture was used to contact 
the nitrocellulose strips containing immobilized RACKl. 
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B . Overlay extraction protocol . 

Nitrocellulose strips containing immobilized RACK1, that 
had been contacted with a solution containing a mixture of PKC 
isozymes, were washed and the area corresponding to the 36 kDa 
5 (RACKl-containing) band was cut out. The pieces (containing 
PKC/RACK1 complexes) were incubated with sample buffer for 10 
minutes at 80°C. The sample buffer and the nitrocellulose pieces 
were then placed in wells in the PAGE gel and subjected to SDS-PAGE 
to elute the bound proteins. The gel was blotted onto 
10 nitrocellulose and a Western blot analysis was carried out using 
the mixture of antibodies (specific for PKC a, /J, y, 5, e and f 
isozymes) described in Example 1 part B. Bound antibodies were 
detected by 125 I-protein A. 

C. PKC overlay in the presence of binding peptides. 

15 Peptides derived from or homologous to WD-4 0 repeats of 

RACK1 were tested for their ability to inhibit PKC binding to 
recombinant RACK1 . Binding of PKC to RACK1 was carried out using 
a variation of the overlay procedure described in Example 3 part 
B. In the experimental samples, peptides were incubated with a 

20 solution containing a mixture of rat brain PKC isozymes (-10 nM 
each) for 15 minutes at room tetnperature . 

Following completion of the modified overlay protocol, 
the samples were subjected to the overlay-extraction protocol 
detailed in part B, above. 

25 The results in Figure 3 show the binding of PKC to RACK1 , 

carried out without (lane 1) or with (lanes 2-4) a preincubation 
of peptides with PKC. Lane 2 shows PKC binding following a 
preincubation with 10 fM peptide I (SEQ ID N0:1) . Peptide I caused 
an 81+6% inhibit ion of PKC binding to recombinant RACK1 as compared 

3 0 with binding in the absence of added peptide (n=3) . Lanes 3 and 

- t can De seer, tnat peptide rl II is somewhat more effective than 
35 peptide rVI . The results shown are representative of three 

- ^, d ri n ^ ^ O ^ *" O "\ r ^ **■ " - "i *~ " 
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The overlay-extraction method (part B above) was used in 
experiments relating to the peptide inhibition of PKC binding in 
order to decrease the possibility that some part of the inhibition 
of PKC binding to RACK1 reflects an interference in the binding of 
5 anti-PKC antibodies to the PKC/RACK1 complexes. Free peptides are 
effectively removed from the PKC/RACK1 complexes during the second 
round of SDS/PAGE, prior to blotting and detection of immobilized 
PKC/RACK1 complexes by anti-PKC antibodies. 

Example 5 

10 Identification of Sequenced Proteins Containing WD-40 Repeats 

A search for WD-40 motif -containing proteins was done 
using the ENTREZ program, release 6.0 (National Center for 
Biotechnology Information, National Library of Medicine, National 
Institutes of Health, Bethesda, MD) . The ENTREZ database was 

15 searched for protein sequences related to the 0 subunit of 
transducin . 

Protein sequences homologous to /?- transducin were 
examined for the existence of WD-40 repeats, following the guidance 
for identification of WD-40 repeats presented in section V of the 
20 specification, above. 

The proteins were ; also used to carry out additional 
searches of the database, in order to identify other proteins which 
may contain WD-4 0 repeats, but which might not be homologous to the 
/? subunit of transducin. Sequences identified during the second 
25 round of searches were again examined for WD-40 repeats. 

This search strategy identified 3 0 proteins containing 
WD-40 sequences. The amino acid sequences of these proteins, with 
the WD-40 regions aligned and delineated, are shown in Figs. 12-18, 
20-27, 29-30, 34-35, 37-38, 40 and 42-50. The sequences are 
30 represented in the Sequence Listing as SEQ ID NO: 29-35, 37-44, 46- 



pattern can be inferred based on the teachings presented in part 
35 V of the specification above. 



WO 95/21252 



PCTAJS95/01210 



- 51 - 

(Eastman Kodak Co., New Haven, CT) to search GenBank (December 1993 
release) . Default settings (matrix=250) were used for the search. 
The search identified the 250 proteins with the highest homology 
to the consensus sequence. These proteins were examined, as 
5 detailed in part V above, for WD-4 0 repeats. Ten additional 
proteins containing WD-40 repeats were identified with this 
strategy. The amino acid sequences of those proteins, with the WD- 
40 repeats aligned and delineated, are shown in Figs. 11, 19, 28, 
31-33, 36, 39, 41 and 51. The sequences are represented in the 
10 Sequence Listing as SEQ ID NO:28, 36, 45, 48-50, 53, 56, 58 and 68. 

Example 6 

Binding of flPKC to RACK1 WD-4 0 -derived Peptides 

A. Buffers . 

Peptide overlay block buffer: 20 mM Tris-HCl (pH 7.5), 
15 0.2 M NaCl, 3% bovine serum albumin (BSA) and 0.1% polyethylene 
glycol . 

Overlay wash buffer: 50 mM Tris-HCl (pH 7.5) , 0.2 M NaCl, 
12 mM 2-mercaptoethanol, 0.1% polyethylene glycol and 0 . 1 mM CaCl 2 . 

B . PKC overlay of immobilized peptides. 

20 The binding of 0PKC to peptide I (SEQ ID NO:l), peptide 

rVI (SEQ ID NO: 7) and control peptide (SEQ ID NO: 9) was assessed 

using a PKC overlay assay similar to that described in Example 3. 
Increasing amounts of peptides (0.5 fimole, 1.0 jxmole, 5.0 /imole and 
10.0 ^mole) suspended in 20 mM NaCl were applied individually onto 

25 nitrocellulose using a slot-blot apparatus (Schleicher and Schuell, 
Keene, NH) . The nitrocellulose membrane was washed three times, 
15 minutes per wash, in peptide overlay buffer and incubated for 
two hours in peptide overlay block buffer. The membrane was cut 
into sections and the sections were transferred to different PKC- 

30 containina solution^ ^nH ^ ... - 

-u ( ai.a ^a.cium, The memoranes were tnen wasnea three times, 15 
minutes per wash, in peptide overlay buffer and incubated in 
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a 16 hr incubation at room temperature, the filters were washed 
three times, 15 minutes per wash, in peptide overlay buffer. 

Binding of PKC was determined using chemiluminescence as 
described in Example 3, part C. Quantitation of PKC binding was 
5 carried out using a "MICRO SCAN" 1000 gel analyzer (Galai Inc., 
Yokneam, Israel) . 

The data show that activated PKC bound to both peptides 
I and rVI, but not to the control peptide, at peptide amounts as 
low as 5 /xmoles. Unactivated PKC did not bind to peptide I, but 
10 did bind to peptide rVI at similar concentrations. 

The results indicate that peptide rVI is capable of 
binding both activated as well as unactivated forms of PKC, whereas 
peptide I binds only to activated PKC. 

Example 7 

15 Effects of RACK1 WD-40-derived Peptides on PKC-mediated Oocyte 

Maturation 

Exposure to insulin induces maturation in Xenopus oocytes 
via a PKC-dependent pathway (Smith, et al . , 1992) . The maturation 
response may be quantified by monitoring the appearance of a white 

2 0 spot in the animal hemisphere of the oocyte, indicating germinal 

vesicle breakdown (GVBD) and maturation. To assess the effects of 
RACK1 WD-4 0 -derived peptides on insulin- induced PKC-mediated 
maturation, 50 nl of a 20 mM NaCl solution containing the indicated 
peptides [peptide I (SEQ ID NO:l; •) , peptide rVI (SEQ ID NO: 7; ■) , 
25 or injection solution (□) ] (peptides at 50 /iM) were microinjected 
into Xenopus oocytes. The symbols refer to symbols used in Figure 
5, which shows the data from this example. One hour following the 
peptide injections, the oocytes were exposed to a solution 
containing insulin (8.25 /ig/ml) for 2 minutes (t=0) . 10-15 oocytes 

3 0 were used for each sample. 



m Figure 5 . 

35 In oocytes injected with buffer or control peptide, onset 
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time-course, reaching a plateau of about 85-90% GVBD at about 10-12 
hours. These data indicate that approximately 80-85% of sham- 
injected oocytes exposed to insulin at t=0 reach maturation, and 
that maturation is reached relatively quickly (within about 10 
5 hours) relative to the time-course of the experiment (20 hours) . 

Oocytes injected with peptide I (SEQ ID NO:l) responded 
in a manner similar to control oocytes, except the plateau was at 
about 45-50% GVBD. These data suggest that injection of peptide 
I blocked maturation in approximately 40-45% of oocytes that would 

10 normally proceed to maturation, but had little effect on the 
kinetics or extent of maturation of the remaining (50-55%) oocytes. 

Oocytes injected with peptide rVI (SEQ ID NO: 7) responded 
with a slightly shorter delay (about 3-4 hours), but reached a 
higher plateau (about 95-100% GVBD) more quickly (within about 5 

15 hours) than control oocytes. These data suggest that peptide rVI 
potentiates the effects of insulin on oocyte maturation, both in 
terms of the rate of maturation, and in the total fraction of 
oocytes that mature during the experiment. Injection of peptide 
rVI increases the maturing fraction to essentially 100% 

20 The effects of both peptides I and rVI on GVBD were dose- 

dependent between 5 /xm-50 0 /xM. 

Since peptide rVI enhanced insulin- induced GVBD, 
experiments were performed to determine whether peptide rVI can 
induce GVBD in the absence of insulin. The data from these 

25 experiments are shown in Fig. 5B . Microinjection of peptide rVI 
(50 jiM) alone, but not peptide I, control peptide or buffer, 
induced GVBD. Maturation initiated with a longer delay (about 6-7 
hours) than in the control insulin- induced oocytes in Fig. 5A 
(about 4-5 hours) , and reached a plateau of about 50% GVBD. 

30 Together, the data above indicate that peptides 

homologous to the WD-40 region of RACK1 modulate the function of 
PKC. Per>Hr3p T inbi^^o^ v>vn ^^.i ^ 



insuiin. The latter result suggests that peptide rVI , under 
appropriate circumstances, may act to activate PKC in the absence 
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Example 8 

Effects of RACK1 WD-4 0 -derived Peptides on PKC Translocation in 

Xenovus Oocytes 

A. Buffers . 

5 Homogenization buffer: 20 mM Tris HCl, pH 7.5, 10 mM 

EGTA, 2 mM EDTA, 0.25M sucrose, 10/xM phenylmethylsulf onyl fluoride, 
2 0/xg/ml of each leupeptin and soybean trypsin inhibitor. 

B. PKC translocation in oocytes. 

Insulin causes the translocation of /?PKC, but not other 

10 PKC isozymes, from a cytosolic form to a membrane -associated form, 
as evidenced by the relative levels of PKC in the soluble vs. the 
particulate fraction of oocyte homogenate. To assess the effects 
of RACK1 WD-40-derived peptides on insulin- induced PKC 
translocation, 50 nl of a 20 mM NaCl solution containing the 

15 indicated peptides were microinjected into Xenopus oocytes. The 
oocytes were then homogenized, and the relative amount of PKC in 
the soluble and particulate fractions was assayed. The protocol 
followed was a modification of a method described by Smith, et al . 
(1992). The results are shown in Figure 6. 

20 Batches of 50 oocytes were microinjected with either 

peptide rVI (SEQ ID NO:7; So pM; lanes 3, 4), peptide I (SEQ ID 
NO:l; 50 /xM, lanes 7, 8) or injection solution (NaCl 20 mM, lanes 
1,2 and 5,6). Homogenates from each batch were prepared 60 
minutes after microinjection (lanes 1-4) or 60 minutes after 

25 addition of insulin (lanes 5-8) . The homogenates were centrifuged 
at 10,000 g for 3 minutes, the upper layer (containing fat and 
yolk) was removed, and the remainder was frozen at -70 °C. Prior 
to use, the samples were thawed, 200 /il homogenization buffer was 
added and the samples were centrifuged at 100,000 g for 30 minutes 

30 at 4 °C. The supernatants (soluble fraction) were removed and 



resolved on an 8% SDS/PAGE gel and blotted onto nitrocellulose. 
3 5 The amount of PKC in each fraction was determined by Western blot 
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Tokyo, Japan) . Bound primary antibodies were detected by 
chemiluminescence as described in Example 3, part C. 

The antibodies showed immunoreactivity with an -80 kDa 
protein that corresponds to 0PKC. Data are representative of three 
5 experiments. 

The data are shown in Figure 6. Lanes 1, 3, 5 and 7 
contain particulate fractions (p) , while lanes 2, 4, 6 and 8 
contain soluble (cytosol) fractions (c) . Peptide I (50 pM) did not 
affect 0PKC distribution in untreated oocytes, but inhibited 
10 insulin-induced /3PKC translocation (Fig. 3, lanes 7,8). In 
contrast, peptide rVI (50 pM) induced 0PKC translocation in the 
absence of insulin treatment (Fig. 3, lanes 3,4). 

The results above suggest that peptide I is an antagonist 
of insulin-induced PKC translocation, whereas peptide rVI is an 
15 agonist and an activator of PKC translocation. In light of the 
results presented in Example 7, the data also suggest that the 
inhibition of insulin- induced GVBD following microinjection of 
peptide I was due to an inhibition of j3PKC translocation. 



20 



Example 9 

Effects of RACK1 WD-40-derived Peptides on Sensitivity of PKC to 

Arg-C Endopeptidase 

A. Buffers . 

Sample buffer: 0.3 M Tris HC1, 5% SDS, 50% glycerol, 
0.01% bromophenol blue and 5% /3-mercaptoethanol . 

2 5 B . Nicking of flPKC by Arg-C endopeptidase. 

Upon activation of PKC, a pseudosubstrate autoinhibitory 
sequence at the N-terminus of the molecule dissociates from the 
catalytic site and becomes sensitive to endopeptidase Arg-C (Orr, 
et al.) . In the absence of PKC activators, exposure of the 80 kDa 

30 flPKC to endopept i dapp Ara- r hx* ^ -~ 



rr^ ^lmitea proteolysis generating a 76 KDa fragment 

and several small fragments (see Fig. 7, lane 2)). Continued 
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the WD-40 region of RACK1 alter the sensitivity of 0PKC to 
endopeptidase Arg-C. 

The methods used to assay Arg-C sensitivity are a 
modification of methods described by Orr, et al . Rat brain PKC (~ 
5 5 nM) was incubated at room temperature in 500 fil of 20 mM Tris-HCl 
buffer (pH 7.5) alone or with Arg-C (5 units/ml) in the presence 
or absence of the indicated peptides (final concentration 10 jiM or 
as indicated) , PS, DG, and calcium (as indicated) . 50 fil aliquots 
were removed into 20 ul of sample buffer during the reaction as 

10 indicated (samples in all the lanes were incubated for 30 minutes 
, except lanes 5, and 6, which were incubated for 5 and 15 minutes, 
respectively) . The samples were boiled for 10 minutes at 80 °C and 
loaded onto 8% SDS-PAGE. 0PKC was detected by Western blot 
analysis using anti-#PKC antibodies as described in Examples 6 and 

15 8 . 

The results are shown in Figure 7 . PKC was incubated for 
the indicated time alone (lane 1) or in the presence of Arg-C 
(lanes 2-9), with DG (0.8 /zg/ml) , PS (50 fig/ml) and CaCl 2 (1 mM; 
lane 2), with PS (50 /ig/ml) and CaCl 2 (1 mM; lane 3), with PS (2.5 

20 fxg/ml) and CaCl 2 (50 fiM; lane 4); with PS (2.5 /zg/ml) , CaCl 2 (50 /xM) 
and with either peptide rVI (SEQ ID N0:7; 10 /zM; lanes 5-7), 
control peptide (SEQ ID NO: 9; iane 8) or with peptide I (SEQ ID 
NO:l; lane 9) . 

Incubation of /3PKC with Arg-C at low concentrations of 

25 activators (2.5 /xg/ml PS and 50 jxM CaCl 2 ) in the absence of added 
peptide did not result in appreciable nicking activity (Fig. 7, 
lane 4) . Similarly, nicking of /3PKC did not occur in the presence 
of this concentration of activators with peptide I (lane 9) or with 
control peptide (lane 8) . However, incubation of 0PKC with the 

30 same concentration of activators in the presence of peptide rVI 
resulted in a time -dependent appearance of the 78 kDa nicked PKC 



35 rVI , but not peptide I, is effective to stabilize PKC in an 
activated conformation that renders it susceptible to Arg-C under 
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Example 10 

Effects of RACK1 WD-40-derived Peptides on PKC 
Autophosphorylation 

Activated PKC is capable of autophosphorylation . Since 
peptide rVI (SEQ ID NO: 7) was effective to induce PKC translocation 
and GVBD in the absence of an activator such as insulin, the 
ability of the peptide to induce PKC autophosphorylation in the 
absence of PKC activators was assessed. 

PKC autophosphorylation in the presence of /?PKC 
pseudosubstrate antibodies or the indicated peptides was carried 
out using a modification of the method described by Makowske, et 
al . Anti-pseudosubstrate antibodies, which were shown previously 
to induce autophosphorylation in the absence of PKC activators 
(Makowske, et al.) were used as a positive control. The results 
are shown in Figure 8 . 

Rat brain PKC (~ 10 nM) was incubated with mild agitation 
in a final volume of 250 fil of overlay buffer, as in Example 1 
either with anti-/JPKC pseudosubstrate antibodies (1:10 dilution, 
Life Technologies, Gaithersburg, MD) or with the indicated peptide 
(10 fjM) . Where indicated, PS (50 ng/ml) , DG (0.8 jig/ml) and CaCl 2 
(1 mM) were also added. The amount of autophosphorylation was 
determined after 2 hours for the reaction with the anti- 
pseudosubstrate antibodies, or after 15 minutes for the other 
samples. 50 /xl of a buffer comprised of 20 mM Tris-HCl (pH 7.5), 
20 mM MgCl 2 , 20 /iM ATP and 5 /xci/ml [7~ 32 P]ATP. The mixture was 
incubated for 15 minutes at room temperature and the reaction was 
stopped by adding 60 /xl sample buffer (see Example 9) . The samples 
were then boiled for 10 minutes, loaded onto a 10% SDS-PAGE mini 
gel and electrophoresed. The gel was fixed with 50% methanol and 
10% acetic acid for 1 hour, and the autophosphorylation of PKC was 
determined by autoradiography. 

The result p i ^ pian^ Q riVi ^' r>v~ ■■ * ~ , 



aixiDcaies vai^utea i:10 in 20 mM Tris-HCl; iane 3), in the 
presence of peptide rVI (SEQ ID NO : 7 ; 10 /zM; lane 4), in the 
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Peptide rVI in the absence of PKC activators induced PKC 
autophosphorylation to over 80% of the autophosphorylation obtained 
in the presence of optimal concentration of PS, DG, and calcium 
(compare Fig. 8 lane 1 (control) with lane 4 (peptide rVI) . 
5 Neither peptide I nor control peptide induced PKC 
autophosphorylation in the absence of PKC activators (Fig. 8 lanes 
5 and 6, respectively) . 

Example 11 

Effects of RACK1 WD-40-derived Peptides on Histone 

10 Phosphorylation by PKC 

Incubation of PKC with peptide rVI (SEQ ID NO: 7) induced 
histone phosphorylation by PKC. The method used was a modification 
of the protocol described by Mochly-Rosen, et al . (1987). The 
results are shown in Figure 9. 

15 Histone type Ills (Sigma Chemical Company, St. Louis, MO) 

was phosphorylated by PKC (- 10 nM) in the absence (lane 1) and 
presence of peptide rVI (10 jiM) (lanes 2 and 3) and in the presence 
and absence of DG (0.8 /ig/ml) , PS (50 jig/ml) and CaCl 2 (1 mM) 
(lane 3) . The results are expressed as percentage of control that 

2 0 is the amount of Histone phosphorylation by PKC in the presence of 
DG (0.8 /xg/ml) , PS (50 jig/ml) and CaCl 2 (1 mM) . The results are 
the average ± SEM of two independent experiments. PKC was first 
incubated with the peptide rVI (10 jiM) for 15 minutes in overlay 
buffer as described above. Histone type Ills (40 fig /ml) was added 

25 in Tris-HCl (20 mM) , MgCl 2 (20 mM) , ATP (20 /zM) and [y- 32 P]ATP (5 
/xci/ml) with or without PS (50 /ig/ml) , DG (0.8 /zg/ml) and CaCl 2 (1 
mM) . Histone phosphorylation was determined by autoradiography as 
above . 

PKC activators PS, DG, and calcium were not required for 
30 either peptide rVI-induced autophosphorylation or histone 



Ills (25/iM) by PKC (10 nM) was not inhibited by RACK1 ; rather, a 
35 4.5±0.1 fold increase of histone phosphorylation occurred when co- 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Mochly-Rosen, Daria 
Ron, Dor it 

(ii) TITLE OF INVENTION: WD-40 - Derived Peptides and Uses 
10 Thereof 

(iii) NUMBER OF SEQUENCES: 265 

(iv) CORRESPONDENCE ADDRESS: 
15 (A) ADDRESSEE: Dehlinger & Associates 

(B) STREET: P.O. Box 60850 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: USA 

20 (F) ZIP: 94306-0850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/190,802 
30 (B) FILING DATE: 01-FEB-1994 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
(A) NAME: Fabian, Gary R. 
3 5 (B) REGISTRATION NUMBER: 33,875 

(C) REFERENCE /DOCKET NUMBER: 8600-0139 

Mxl TELECOMMUNICATION INFORMATION • 



(2) INFORMATION FOR SEQ ID NO : 1 : 
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(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

10 (vi) ORIGINAL SOURCE: 



15 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Lys Gly Asp Tyr Glu Lys He Leu Val Ala Leu Cys Gly Gly Asn 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

3 0 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide, rl, Fig. 1C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2; 



(i) SEQUENCE CHARACTERISTICS: 
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(ii) MOLECULE TYPE : peptide 

{iii) HYPOTHETICAL: NO 

5 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rll, Fig. 1C 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Phe Val Ser Asp Val Val He 
1 5 

(2) INFORMATION FOR SEQ ID NO:4: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
2 0 (B) TYPE: amino acid 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO : 

(iv) ANTI- SENSE : NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: Peptide rill, Fig. 1C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4; 

3 5 Asp Val Leu Ser Val Ala Phe 

1 5 

TMFOpMATTOY POP c;vo Tn MO ■ ^ ■ 



(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: peptide rIV, Fig. 1C 



10 



15 



20 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Val Ser Cys Val Arg Phe Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

{ D ) TOP OLOGY : unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rV, Fig. 1C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Gly Tyr Leu Asn Thr Val Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 7: 

( i ) SEQUENCE CHARACTERISTICS : 



(ii) MOLECULE TYPE: peptide 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rVI , Fig. 1C 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Asp lie lie Asn Ala Leu Cys Phe 
10 1 5 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Peptide rVII, Fig. 1C 



30 



35 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 8 : 

Pro Gin Cys Thr Ser Leu Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino arid 



(iii) HYPOTHETICAL: 



NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: control peptide 1, homol. to RACK1 
261-266, LKGKIL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Leu Lys Gly Lys lie Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 
2 5 (C) INDIVIDUAL ISOLATE: control peptide 2, iden. to RACK1 , 

265 to 270 IIVDEL 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10 : 

lie He Val Asp Glu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PKC substrate peptide, (Ser25) 
PKC(19-36) 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Arg Phe Ala Arg Lys Gly Ser Leu Arg Gin Lys Asn Val His Glu Val 
15 10 15 

10 

T.VR 2\rti 



15 



20 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE : PKC Pseudosubstrate Inhibitor 
<PCK(19-36) ) 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His Glu Val 
35 1 5 10 15 

Lys Asn 



U SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBH Peptide, rl, Fig. 24 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro Asp Met lie 

15 10 15 

15 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(C) INDIVIDUAL ISOLATE: GBH Peptide rll, Fig. 24 



25 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Phe Val Ser Asp Val Val lie Ser Ser Asp Gly Gin Phe Ala Leu 
35 1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 15: 



(D) TOPOLOGY: unknown 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBH Peptide rill, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Asp Val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg Gin lie Val 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 16 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: GBH Peptide rIV, Fig. 24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

3 0 Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser Asn Pro lie 

15 10 15 

(2) INFORMATION FOR SEQ ID NO: 17: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



in, HYPOTHETICAL: NO 
(iv) ANTI - SENSE : NO 
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(C) INDIVIDUAL ISOLATE: GBH Peptide rV, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

5 Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 

15 10 15 

(2) INFORMATION FOR SEQ ID NO: 18: 

10 (i) SEQUENCE CHARACTERISTICS: 

( a ) LENGTH: IS amino acids 
(B) TYPE: amino acid 
{ D ) TOPOLOGY : unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBH Peptide rVI , Fig. 24 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Thr Leu Asp Gly Gly Asp lie lie Asn Ala Leu Cys Phe Ser Pro 

15 10 15 

3 0 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1115 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

M i > MOTFnHF TYPF ■ ^promin^ 



(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGCACGAGGG GTCGCGGTGG CAGCCGTGCG GTGCTTGGCT CCCTAAGCTA TCCGGTGCCA 
60 

5 

TCCTTGTCGC TGCGGCGACT CGCAACATCT GCAGCCATGA CCGAGCAAAT GACCCTTCGT 120 

GGGACCCTCA AGGGCCATAA TGGATGGGTT ACACAGATCG CCACCACTCC GCAGTTCCCG 180 

10 GACATGATCC TGTCGGCGTC TCGAGACAAG ACCATCATCA TGTGGAAGCT GACCAGGGAT 24 0 

GAGACCAACT ACGGCATACC ACAACGTGCT CTTCGAGGTC ACTCCCACTT TGTTAGCGAT 30 0 

GTTGTCATCT CCTCTGATGG CCAGTTTGCC CTCTCAGGCT CCTGGGATGG AACCCTACGC 36 0 

15 

CTCTGGGATC TCACAACGGG CACTACCACG AGACGATTTG TCGGCCACAC CAAGGATGTG 42 0 

CTGAGCGTGG CTTTCTCCTC TGACAACCGG CAGATTGTCT CTGGGTCCCG AGACAAGACC 480 

2 0 ATTAAGTTAT GGAATACTCT GGGTGTCTGC AAGTACACTG TCCAGGATGA GAGTCATTCA 540 

GAATGGGTGT CTTGTGTCCG CTTCTCCCCG AACAGCAGCA ACCCTATCAT CGTCTCCTGC 6 00 

GGATGGGACA AGCTGGTCAA GGTGTGGAAT CTGGCTAACT GCAAGCTAAA GACCAACCAC 660 

25 

ATTGGCCACA CTGGCTATCT GAACACAGTG ACTGTCTCTC CAGATGGATC CCTCTGTGCT 72 0 

TCTGGAGGCA AGGATGGCCA GGCTATGCTG TGGGATCTCA ATGAAGGCAA GCACCTTTAC 7 80 

3 0 ACATTAGATG GTG GAGA CAT CATCAATGCC TTGTGCTTCA GCCCCAACCG CTACTGGCTC 84 0 

TGTGCTGCCA CTGGCCCCAG TATCAAGATC TGGGACTTGG AGGGCAAGAT CATGGTAGAT 900 

GAACTGAAGC AAGAAGTTAT CAGCACCAGC AGCAAGGCAG AGCCACCCCA GTGTACCTCT 96 0 

35 

TTGGCTTGGT CTGCTGATGG CCAGACTCTG TTTGCTGGCT ATACCGACAA CTTGGTGCGT 102 0 

n T T , j\ r yrinr i j^n ; n. Tn^PTATT^^ TArf^C^A A A^^^TTi^^ . /"^t"^^ ^ * * ? * t> . 1 n o n 

(2) INFORMATION FOR SEQ ID NO: 20: 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

5 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
10 (vi) ORIGINAL SOURCE: 



15 



35 



( r \ INDIVIDUAL ISOLATE: RACK1 r I DMA. Secjuen.ce, 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGCCATAATG GATGGGTTAC ACAGATCGCC ACCACTCCGC AGTTCCCGGA CATGATCCTG 
60 



TCGGCGTCTC GAGACAAGAC CATCATCATG TGGAAG 
20 96 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 rll DNA Sequence 



GGTCACTCCC ACTTTGTTAG CGATGTTGTC ATCTCCTCTG ATGGCCAGTT TGCCCTCTCA 
60 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 rill DNA Sequence, Fig. 1A 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

20 

GGCCACACCA AGGATGTGCT GAGCGTGGCT TTCTCCTCTG ACAACCGGCA GATTGTCTCT 
60 

GGGTCCCGAG ACAAGACCAT TAAGTTATGG AAT 
25 93 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



(C) INDIVIDUAL ISOLATE: RACK1 rIV DNA Sequence, Fig. 1A 
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AGTCATTCAG AATGGGTGTC TTGTGTCCGC TTCTCCCCGA ACAGCAGCAA CCCTATCATC 
60 

GTCTCCTGCG GATGGGACAA GCTGGTCAAG GTGTGGAAT 
5 99 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDRDNFSS * double 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 rV DNA Sequence, Fig. 1A 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

GGCCACACTG GCTATCTGAA CACAGTGACT GTCTCTCCAG ATGGATCCCT CTGTGCTTCT 
60 

30 GGAGGCAAGG ATGGCCAGGC TATGCTGTGG GAT 

93 

(2) INFORMATION FOR SEQ ID NO: 25: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

<T)) TOPOLOGY • linp^ 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 rVI DNA Sequence, Fig. 1A 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TTAGATGGTG GAGACATCAT CAATGCCTTG TGCTTCAGCC CCAACCGCTA CTGGCTCTGT 
60 

10 GCTGCCACTG GCCCCAGTAT CAAGATCTGG GAC 
°2 

(2) INFORMATION FOR SEQ ID NO:26: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 rVII DNA Sequence, Fig. 1A 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

AGCAAGGCAG AGCCACCCCA GTGTACCTCT TTGGCTTGGT CTGCTGATGG CCAGACTCTG 
60 

TTTGCTGGCT ATACCGACAA CTTGGTGCGT GTATGGCAG 
99 



(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 Amino Acid Sequence, Fig. 1C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

10 

Met Thr Glu Gin Met Thr Leu Arg Gly Thr Leu Lys Gly His Asn Gly 
15 10 15 

Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro Asp Met He Leu 
15 20 25 30 

Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys Leu Thr Arg Asp 
35 40 45 

2 0 Glu Thr Asn Tyr Gly He Pro Gin Arg Ala Leu Arg Gly His Ser His 

50 55 60 

Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin Phe Ala Leu Ser 
65 70 75 80 

25 

Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Thr Thr Gly Thr 

85 90 95 

Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val Ala 

30 100 105 110 

Phe Ser Ser Asp Asn Arg Gin He Val Ser Gly Ser Arg Asp Lys Thr 
115 120 125 

35 He Lys Leu Trp Asn Thr Leu Gly Val Cys Lys Tyr Thr Val Gin Asp 

130 135 140 

Glu Ser His Ser Glu Trr> Val Spr fvs Val ^rn Php ^o>- p>^ cor- 



165 



170 



175 
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Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 
195 200 205 

Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp Leu Asn Glu Gly 
5 210 215 220 

Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 
225 230 235 240 

10 Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 

245 250 255 

Lys He Trp Asp Leu Glu Gly Lys He He Val Asp Glu Leu Lys Gin 
260 265 270 

15 

Glu Val He Ser Thr Ser Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser 
275 280 285 

Leu Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp 
20 290 295 300 

Asn Leu Val Arg Val Trp Gin Val Thr He Gly Thr Arg 
305 310 315 



25 



(2) INFORMATION FOR SEQ ID NO: 28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

35 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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10 



15 



Val Ala Lys Glu Thr Pro Asp Lys Val Glu Leu Ser Lys Glu Glu Val 
20 25 30 



Lys Arg Leu He Ala Glu Ala Lys Glu Lys Leu Gin Glu Glu Gly Gly 
35 40 45 



Gly Ser Asp Glu Glu Glu Thr Gly Ser Pro Ser Glu Asp Gly Met Gin 
10 50 55 60 

Ser Ala Arg Thr Gin Ala Arg Pro Arg Glu Pro Leu Glu Asp Gly Asp 
65 70 75 80 

15 Pro Glu Asp Asp Arg Thr Leu Asp Asp Asp Glu Leu Ala Glu Tyr Asp 

85 90 95 



20 



Leu Asp Lys Tyr Asp Glu Glu Gly Asp Pro Asp Ala Glu Thr Leu Gly 
100 105 110 

Glu Ser Leu Leu Gly lieu Thr Val Tyr Gly Ser Asn Asp Gin Asp Pro 
115 120 125 



Tyr Val Thr Leu Lys Asp Thr Glu Gin Tyr Glu Arg Glu Asp Phe Leu 

25 130 135 140 

He Lys Pro Ser Asp Asn Leu He Val Cys Gly Arg Ala Glu Gin Asp 

145 150 155 160 

3 0 Gin Cys Asn Leu Glu Val His Val Tyr Asn Gin Glu Glu Asp Ser Phe 

165 170 175 



35 



Tyr Val His His Asp He Leu Leu Ser Ala Tyr Pro Leu Ser Val Glu 
180 185 190 

Trp Leu Asn Phe Asp Pro Ser Pro Asp Asp Ser Thr Gly Asn Tyr He 
195 200 205 



Val Asp Ser Leu Glu Pro Val Phe Thr Leu Gly Ser Lys Leu Ser Lys 
225 230 235 240 
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Asp Ala Val Leu Asp Leu Ser Trp Asn Lys Leu lie Arg Asn Val Leu 
260 265 270 

Ala Ser Ala Ser Ala Asp Asn Thr Val He Leu Trp Asp Met Ser Leu 
5 275 280 285 

Gly Lys Pro Ala Ala Ser Leu Ala Val His Thr Asp Lys Val Gin Thr 

290 295 300 

10 Leu Gin Phe His Pro Phe Glu Ala Gin Thr Leu He Ser Gly Ser Tyr 



Asp Lys Ser Val Ala Leu Tyr Asp Cys Arg Ser Pro Asp Glu Ser His 
325 330 335 

15 

Arg Met Trp Arg Phe Ser Gly Gin He Glu Arg Val Thr Trp Asn His 
340 345 350 

Phe Ser Pro Cys His Phe Leu Ala Ser Thr Asp Asp Gly Phe Val Tyr 
20 355 360 365 

Asn Leu Asp Ala Arg Ser Asp Lys Pro He Phe Thr Leu Asn Ala His 
370 375 380 

2 5 Asn Asp Glu He Ser Gly Leu Asp Leu Ser Ser Gin He Lys Gly Cys 

385 390 395 400 

Leu Val Thr Ala Ser Ala Asp Lys Tyr Val Lys He Trp Asp He Leu 
405 410 415 

30 

Gly Asp Arg Pro Ser Leu Val His Ser Arg Asp Met Lys Met Gly Val 
420 425 430 

Leu Phe Cys Ser Ser Cys Cys Pro Asp Leu Pro Phe He Tyr Ala Phe 
35 435 440 445 

Gly Gly Gin Lys Glu Gly Leu Arg Val Trp Asp He Ser Thr Val Ser 

.« c n 1 r r * ^ ~ 



Ala Arg Asn Ser Ser He Ser Gly Pro Phe Gly Ser Arg Ser Ser Asp 
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500 

(2) INFORMATION FOR SEQ ID NO: 29: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein, Fig. 12 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Pro Gly Gly Phe Gin His Leu Gin Gin Gin Gin Gin Gin Gin Gin Gin 

15 10 15 

25 Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Thr Gin Val Gin 

20 25 30 

Gin Leu His Asn Gin Leu His Gin Gin His Asn Gin Gin lie Gin Gin 
35 40 45 

30 

Gin Ala Gin Ala Thr Gin Gin His Leu Gin Thr Gin Gin Tyr Leu Gin 
50 55 60 

Ser Gin lie His Gin Gin Ser Gin Gin Ser Gin Leu Ser Asn Asn Leu 
35 65 70 75 80 

Asn Ser Asn Ser Lys Glu Ser Thr Asn lie Pro Lys Thr Asn Thr Gin 



Ser Glu Cys Ser Thr Lys Asp Phe He Gly Asn Lys Lys Lys Ser Thr 
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130 

Asp Gly lie Val 
145 

5 

Asn Asn Asn Asn 

lie Lys Glu Thr 
10 180 

lie Ser Trp Ser 
195 



- 79 - 
135 

Arg Val Trp Asn 
150 

Ser Asn Asn Thr 

165 

lie Glu Leu Lys 

Pro Lys Asn Asn 

200 



140 

Phe Asp Pro Leu 
155 

Ser Ser Asn Ser 

170 

Gly His Asp Gly 
185 

Asp Leu Leu Ala 



Gly Asn Ser Asn 
160 

Lys Asn Asn Asn 
175 

Ser lie Glu Lys 
190 

Ser Ala Gly Thr 
205 



15 Asp Lys Val lie Lys lie Trp Asp Val Lys lie Gly Lys Cys lie Gly 

210 215 220 

Thr Val Ser Thr Asn Ser Glu Asn lie Asp Val Arg Trp Ser Pro Asp 
225 230 235 240 

20 

Gly Asp His Leu Ala Leu He Asp Leu Pro Thr He Lys Thr Leu Lys 
245 250 255 

He Tyr Lys Phe Asn Gly Glu Glu Leu Asn Gin Val Gly Trp Asp Asn 
25 260 265 270 

Asn Gly Asp Leu He Leu Met Ala Asn Ser Met Gly Asn He Glu Ala 
275 280 285 

3 0 Tyr Lys Phe Leu Pro Lys Ser Thr Thr His Val Lys His Leu Lys Thr 

290 295 300 

Leu Tyr Gly His Thr Ala Ser He Tyr Cys Met Glu Phe Asp Pro Thr 
305 310 315 320 



35 



Gly Lys Tyr Leu Ala Ala Gly Ser Ala Asp Ser He Val Ser Leu Trp 
325 330 335 



Pro Cys Arg Ser Val Ser Phe Ser Phe Asp Gly Gin Phe lie Ala Ala 
355 360 365 
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Pro He His Thr He Glu Cys Gly Val " Ser Ser Leu Met Trp His Pro 
385 390 395 400 

Thr Leu Pro Leu Leu Ala Tyr Ala Pro Glu Ser He Asn Glu Asn Asn 
5 405 410 415 

Lys Asp Pro Ser He Arg Val Phe Gly Tyr His Ser 
420 425 

10 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 517 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP, Fig. 13 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Met Glu Gly Phe Ser Cys Ser Leu Gin Pro Pro Thr Ala Ser Glu Arg 
30 1 5 10 15 

Glu Asp Cys Asn Arg Asp Glu Pro Pro Arg Lys He He Thr Glu Lys 
20 25 30 

3 5 Asn Thr Leu Arg Gin Thr Lys Leu Ala Asn Gly Thr Ser Ser Met He 

35 40 45 



65 



75 



80 
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lie Asn Thr Tyr Leu Lys Pro Met Leu Gin Arg Asp Phe lie Thr Ala 
100 105 110 

Leu Pro Ala Arg Gly Leu Asp His lie Ala Glu Asn lie Leu Ser Tyr 
5 115 120 125 

Leu Asp Ala Lys Ser Leu Cys Ser Ala Glu Leu Val Cys Lys Glu Trp 
130 135 140 

10 Tyr Arg Val Thr Ser Asp Gly Met Leu Trp Lys Lys Leu lie Glu Arg 

145 150 155 160 

Met Val Arg Thr Asp Ser Leu Trp Arg Gly Leu Ala Glu Arg Arg Gly 
165 170 175 

15 

Trp Gly Gin Tyr Leu Phe Lys Asn Lys Pro Pro Asp Gly Lys Thr Pro 
180 185 190 

Pro Asn Ser Phe Tyr Arg Ala Leu Tyr Pro Lys lie lie Gin Asp lie 
20 195 200 205 

Glu Thr lie Glu Ser Asn Trp Arg Cys Gly Arg His Ser Leu Gin Arg 
210 215 220 

2 5 lie His Cys Arg Ser Glu Thr Ser Lys Gly Val Tyr Cys Leu Gin Tyr 

225 230 235 240 

Asp Asp Gin Lys lie Val Ser Gly Leu Arg Asp Asn Thr lie Lys lie 
245 250 255 

30 

Trp Asp Lys Asn Thr Leu Glu Cys Lys Arg Val Leu Met Gly His Thr 
260 265 270 

Gly Ser Val Leu Cys Leu Gin Tyr Asp Glu Arg Val lie lie Thr Gly 
35 275 280 285 

Ser Asp Ser Thr Val Arg Val Trp Asp Val Asn Thr Gly Glu Met Leu 

290 295 300 



Asn Gly Met Met Val Thr Cys Ser Lys Asp Arg Ser He Ala Val Trp 

7 2 t; 
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340 345 350 

His Arg Ala Ala Val Asn Val Val Asp Phe Asp Asp Lys Tyr lie Val 
355 360 365 

5 

Ser Ala Ser Gly Asp Arg Thr He Lys Val Trp Asn Thr Ser Thr Cys 
370 375 380 

Glu Phe Val Arg Thr Leu Asn Gly His Lys Arg Gly He Ala Cys Leu 
10 385 390 395 400 

Gin Tyr Arg Asp Arg Leu Val Val Ser Gly Ser Ser Asp Asn Thr He 
405 410 415 

15 Arg Leu Trp Asp He Glu Cys Gly Ala Cys Leu Arg Val Leu Glu Gly 

420 425 430 

His Glu Glu Leu Val Arg Cys He Arg Phe Asp Asn Lys Arg He Val 
435 440 445 

20 

Ser Gly Ala Tyr Asp Gly Lys He Lys Val Trp Asp Leu Val Ala Ala 
450 455 460 

Leu Asp Pro Arg Ala Pro Ala Gly Thr Leu Cys Leu Arg Thr Leu Val 
25 465 470 475 480 

Glu His Ser Gly Arg Val Phe Arg Leu Gin Phe Asp Glu Phe Gin He 
485 490 495 

30 Val Ser Ser Ser His Asp Asp Thr He Leu He Trp Asp Phe Leu Asn 

500 505 510 

Asp Pro Gly Leu Ala 
515 



35 



(2) INFORMATION FOR SEQ ID NO: 31: 



,L; TOPOLOGY : uriKnowi. 
di) MOLECULE TYPE: protein 



WO 95/21252 



PCT/US95/01210 



- 83 - 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta-prime- cop, Fig. 14 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Met Pro Leu Arg Leu Asp lie Lys Arg Lys Leu Thr Ala Arg Ser Asp 
10 1 5 10 15 

Arg Val Lys Ser Val Asp Leu His Pro Thr Glu Pro Trp Met Leu Ala 
20 25 30 

15 Ser Leu Tyr Asn Gly Ser Val Cys Val Trp Asn His Glu Thr Gin Thr 

35 40 45 

Leu Val Lys Thr Phe Glu Val Cys Asp Leu Pro Val Arg Ala Ala Lys 
50 55 60 

20 

Phe Val Ala Arg Lys Asn Trp Val Val Thr Gly Ala Asp Asp Met Gin 
65 70 75 80 

lie Arg Val Phe Asn Tyr Asn Thr Leu Glu Arg Val His Met Phe Glu 
25 85 90 95 

Ala His Ser Asp Tyr lie Arg Cys lie Ala Val His Pro Thr Gin Pro 
100 105 110 

3 0 Phe lie Leu Thr Ser Ser Asp Asp Met Leu He Lys Leu Trp Asp Trp 

115 120 125 

Asp Lys Lys Trp Ser Cys Ser Gin Val Phe Glu Gly His Thr His Tyr 
130 135 140 



35 



Val Met Gin He Val He Asn Pro Lys Asp Asn Asn Gin Phe Ala Ser 
145 150 155 160 



Pro Asn Phe Thr Leu Glu Gly His Glu Lys Gly Val Asn Cys He Asp 
180 185 190 
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Arg Leu Val Lys He Trp Asp Tyr Gin Asn Lys Thr Cys Val Gin Thr 
210 215 220 

Leu Glu Gly His Ala Gin Asn Val Ser Cys Ala Ser Phe His Pro Glu 
5 225 230 235 240 

Leu Pro He He He Thr Gly Ser Glu Asp Gly Thr Val Arg He Trp 

245 250 255 

10 His Ser Ser Thr Tyr Arg Leu Glu Ser Thr Leu Asn Tyr Gly Met Glu 

260 265 270 

Arg Val Trp Cys Val Ala Ser Leu Arg Gly Ser Asn Asn Val Ala Leu 
275 280 285 

15 

Gly Tyr Asp Glu Gly Ser He He Val Lys Leu Gly Arg Glu Glu Pro 
290 295 300 

Ala Met Ser Met Asp Ala Asn Gly Lys lie He Trp Ala Lys His Ser 
20 305 310 315 320 

Glu Val Gin Gin Ala Asn Leu Lys Ala Met Gly Asp Ala Glu He Lys 
325 330 335 

2 5 Asp Gly Glu Arg Leu Pro Leu Ala Val Lys Asp Met Gly Ser Cys Glu 

340 345 350 

He Tyr Pro Gin Thr He Gin His Asn Pro Asn Gly Arg Phe Val Val 
355 360 365 

30 

Val Cys Gly Asp Gly Glu Tyr He He Tyr Thr Ala Met Ala Leu Arg 
370 375 380 

Asn Lys Ser Phe Gly Ser Ala Gin Glu Phe Ala Trp Ala His Asp Ser 
35 385 390 395 400 

Ser Glu Tyr Ala He Arg Glu Ser Asn Ser Val Val Lys He Phe Lys 

ijnc 4 i n 4 7 ^ 



He Tyr Gly Gly Phe Leu Leu Gly Val Arg Ser Val Asn Gly Leu Ala 

< "> r - \ \ ~ \ \ z 
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450 455 460 

Pro Lys His lie Phe Trp Ser Asp Ser Gly Glu Leu Val Cys lie Ala 
465 470 475 480 

5 

Thr Glu Glu Ser Phe Phe He Leu Lys Tyr Leu Ser Glu Lys Val Leu 
485 490 495 

Ala Ala Gin Glu Thr His Glu Gly Val Thr Glu Asp Gly He Glu Asp 
10 500 505 510 

Gly Phe Glu Val Leu Gly Glu He Gin Glu He Val Lys Thr Gly Leu 
515 520 525 

15 Trp Val Gly Asp Cys Phe He Tyr Thr Ser Ser Val Asn Arg Leu Asn 

530 535 540 

Tyr Tyr Val Gly Gly Glu lie Val Thr He Ala His Leu Asp Arg Thr 

545 550 555 560 

20 

Met Tyr Leu Leu Gly Tyr lie Pro Lys Asp Asn Arg Leu Tyr Leu Gly 
565 570 575 

Asp Lys Glu Leu Asn He Val Ser Tyr Ser Leu Leu Val Ser Val Leu 
25 580 585 590 

Glu Tyr Gin Thr Ala Val Met Arg Arg Asp Phe Ser Met Ala Asp Lys 

595 600 605 

30 Val Leu Pro Thr He Pro Lys Glu Gin Arg Thr Arg Val Ala His Phe 

610 615 620 

Leu Glu Lys Gin Gly Phe Lys Gin Gin Ala Leu Thr Val Ser Thr Asp 
625 630 635 640 

35 

Pro Glu His Arg Phe Glu Leu Ala Leu Gin Leu Gly Glu Leu Lys He 
645 650 655 



w eu Ala Giu Leu Ala lie Ser Lys Cys Pro Phe Gly Leu Ala Gin Glu 
675 680 685 
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Ala Ser Gly Asn Ala Ser Met Val Asn Lys Leu Ala Glu Gly Ala Glu. 
705 710 715 720 

Arg Asp Gly Lys Asn Asn Val Ala Phe Met Ser Tyr Phe Leu Gin Gly 
725 730 735 

Lys Leu Asp Ala Cys Leu Glu Leu Leu lie Arg Thr Gly Arg Leu Pro 
740 745 750 



10 



Glu Ala Ala Phe Leu Ala Arg Thr Tyr Leu Pro Ser Gin Val Ser Arg 



15 



Val Val Lys Leu Trp Arg Glu Asn Leu Ser Lys Val Asn Gin Lys Ala 
770 775 780 

Ala Glu Ser Leu Ala Asp Pro Thr Glu Tyr Glu Asn Leu Phe Pro Gly 
785 790 795 800 



20 



Leu Lys Glu Ala Phe Val Val Glu Glu Trp Val Lys Glu Thr His Ala 
805 810 815 



Asp Leu Trp Pro Ala Lys Gin Tyr Pro Leu Val Thr Pro Asn Glu Glu 
820 825 830 



25 



Arg Asn Val Met Glu Glu Ala Lys Gly Phe Gin Pro Ser Arg Ser Ala 
835 840 845 



30 



Ala Gin Gin Glu Leu Asp Gly Lys Pro Ala Ser Pro Thr Pro Val lie 
850 855 860 

Val Thr Ser Gin Thr Ala Asn Lys Glu Glu Lys Ser Leu Leu Glu Leu 
865 870 875 880 



35 



Glu Val Asp Leu Asp Asn Leu Glu lie Glu Asp lie Asp Thr Thr Asp 
885 890 895 



lie Asn Leu Asp Glu Asp lie Leu Asp Asp 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 779 amino acids 
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(ii) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein, Fig. 15 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Ser Phe Pro Leu Ala Glu Phe Pro Leu Arg Asp lie Pro Val 
15 10 15 

15 

Pro Tyr Ser Tyr Arg Val Ser Gly Gly lie Ala Ser Ser Gly Ser Val 
20 25 30 

Thr Ala Leu Val Thr Ala Ala Gly Thr His Arg Asn Ser Ser Thr Ala 
20 35 40 45 

Lys Thr Val Glu Thr Glu Asp Gly Glu Glu Asp lie Asp Glu Tyr Gin 
50 55 60 

25 Arg Lys Arg Ala Ala Gly Ser Gly Glu Ser Thr Pro Glu Arg Ser Asp 

65 70 75 80 

Phe Lys Arg Val Lys His Asp Asn His Lys Thr Leu His Pro Val Asn 

85 90 95 

30 

Leu Gin Asn Thr Gly Ala Ala Ser Val Asp Asn Asp Gly Leu His Asn 
100 105 110 

Leu Thr Asp lie Ser Asn Asp Ala Glu Lys Leu Leu Met Ser Val Asp 
35 115 120 125 

Asp Gly Ser Ala Ala Pro Ser Thr Leu Ser Val Asn Met Gly Val Ala 
130 135 140 



Gly Ser Asp Val Ser Asn Asn Val Asn Ser Ala Thr lie Asn Asn Pro 
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180 185 190 

Thr Thr Thr Pro Leu Ala Lys Thr Thr Lys Thr lie Asn Asn Asn Asn 
195 200 205 

Asn lie Ala Asp Leu He Glu Ser Lys Asp Ser He He Ser Pro Glu 
210 215 220 

Tyr Leu Ser Asp Glu He Phe Ser Ala He Asn Asn Asn Leu Pro His 
10 225 230 235 240 

Ala Tyr Phe Lys Asn Leu Leu Phe Arg Leu Val Ala Asn Met Asp Arg 
245 250 255 

15 Ser Glu Leu Ser Asp Leu Gly Thr Leu He Lys Asp Asn Leu Lys Arg 

260 265 270 

Asp Leu He Thr Ser Leu Pro Phe Glu He Ser Leu Lys He Phe Asn 
275 280 285 

20 

Tyr Leu Gin Phe Glu Asp He He Asn Ser Leu Gly Val Ser Gin Asn 
290 295 300 

Trp Asn Lys He He Arg Lys Ser Thr Ser Leu Trp Lys Lys Leu Leu 
25 305 310 315 320 

He Ser Glu Asn Phe Val Ser Pro Lys Gly Phe Asn Ser Leu Asn Leu 
325 330 335 

3 0 Lys Leu Ser Gin Lys Tyr Pro Lys Leu Ser Gin Gin Asp Arg Leu Arg 

340 345 350 

Leu Ser Phe Leu Glu Asn He Phe He Leu Lys Asn Trp Tyr Asn Pro 
355 360 365 



35 



Lys Phe Val Pro Gin Arg Thr Thr Leu Arg Gly His Met Thr Ser Val 
370 375 380 



Asp Lys Met He Arg Val Tyr Asp Ser He Asn Lys Lys Phe Leu Leu 
405 410 415 



WO 95/21252 



PCT/US95/01210 



- 89 - 

Gly Gly He Leu Val Ser Gly Ser Thr Asp Arg Thr Val Arg Val Trp 
435 440 445 

Asp He Lys Lys Gly Cys Cys Thr His Val Phe Glu Gly His Asn Ser 
5 450 455 460 

Thr Val Arg Cys Leu Asp He Val Glu Tyr Lys Asn He Lys Tyr He 
465 470 475 480 



10 Val Thr Gly Ser Arg Asp Asn Thr Leu His Val Trp Lys Leu Pro Lys 

485 490 495 

Glu Ser Ser Val Pro Asp His Gly Glu Glu His Asp Tyr Pro Leu Val 
500 505 510 

15 

Phe His Thr Pro Glu Glu Asn Pro Tyr Phe Val Gly Val Leu Arg Gly 
515 520 525 

His Met Ala Ser Val Arg Thr Val Ser Gly His Gly Asn He Val Val 
20 530 535 540 

Ser Gly Ser Tyr Asp Asn Thr Leu He Val Trp Asp Val Ala Gin Met 
545 550 555 560 

25 Lys Cys Leu Tyr He Leu Ser Gly His Thr Asp Arg He Tyr Ser Thr 

565 570 575 

He Tyr Asp His Glu Arg Lys Arg Cys He Ser Ala Ser Met Asp Thr 
580 585 590 

30 

Thr He Arg lie Trp Asp Leu Glu Asn He Trp Asn Asn Gly Glu Cys 
595 600 605 

Ser Tyr Ala Thr Asn Ser Ala Ser Pro Cys Ala Lys He Leu Gly Ala 
35 610 615 620 

Met Tyr Thr Leu Gin Gly His Thr Ala Leu Val Gly Leu Leu Arg Leu 

625 630 f;^ c A n 



rrp Asp Ala Asn Asp Tyr Ser Arg Lys Phe Ser Tyr His His Thr Asn 

660 £ c; r, C~>n 
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675 680 685 

Gly Ser Glu Asn Gin Phe Asn lie Tyr Asn Leu Arg Ser Gly Lys Leu 
690 695 700 

5 

Val His Ala Asn lie Leu Lys Asp Ala Asp Gin lie Trp Ser Val Asn 
705 710 715 720 

Phe Lys Gly Lys Thr Leu Val Ala Ala Val Glu Lys Asp Gly Gin Ser 
10 725 730 735 

Phe Leu Glu He Leu Asp Phe Ser Lys Ala Ser Lys He Asn Tyr Val 
740 745 750 

15 Ser Asn Pro Val Asn Ser Ser Ser Ser Ser Leu Glu Ser He Ser Thr 

755 760 765 

Ser Leu Gly Leu Thr Arg Thr Thr He He Pro 
770 775 



20 



(2) INFORMATION FOR SEQ ID NO; 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : protein 

3 0 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: GBLP - CHLAM I DOMONAS HOMOLOG, Fig. 16 

fxi) SEQUENCE DESCRIPTION • SEO TO NO • "n ■ 



Trp Val Thr Ala He Ala Thr Pro Leu Asp Pro Ser Ser Asn Thr Leu 
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35 40 45 

Ser Glu Ser Asn Tyr Gly Tyr Ala Arg Lys Ala Leu Arg Gly His Ser 

50 55 60 

5 

His Phe Val Gin Asp Val Val He Ser Ser Asp Gly Gin Phe Cys Leu 
65 70 75 80 

Thr Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Asn Thr Gly 
10 85 90 95 

Thr Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val 
100 105 110 

15 Ala Phe Ser Val Asp Asn Arg Gin He Val Ser Gly Ser Arg Asp Lys 

115 120 125 

Thr He Lys Leu Trp Asn Thr Leu Gly Glu Cys Lys Tyr Thr He Gly 
130 135 140 

20 

Glu Pro Glu Gly His Thr Glu Trp Val Ser Cys Val Arg Phe Ser Pro 
145 150 155 160 

Met Thr Thr Asn Pro He He Val Ser Gly Gly Trp Asp Lys Met Val 
25 165 170 175 

Lys Val Trp Asn Leu Thr Asn Cys Lys Leu Lys Asn Asn Leu Val Gly 
180 185 190 

30 His His Gly Tyr Val Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu 

195 200 205 

Cys Ala Ser Gly Gly Lys Asp Gly He Ala Met Leu Trp Asp Leu Ala 
210 215 220 



35 



Glu Gly Lys Arg Leu Tyr Ser Leu Asp Ala Gly Asp Val He His Cys 
225 230 235 240 



Ser lie Lys He Trp Asp Leu Glu Ser Lys Ser He Val Asp Asp Leu 
260 265 270 
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Val Ser Leu Ala Trp Ser Ala Asp Gly Ser Thr Leu Tyr Ser Gly Tyr 
290 295 300 

Thr Asp Gly Gin lie Arg Val Trp Ala Val Gly His Ser Leu 
5 305 310 315 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 658 amino acids 

'B^ TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : protein 

15 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: cop-1 protein, Fig. 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

25 

Met Glu Glu lie Ser Thr Asp Pro Val Val Pro Ala Val Lys Pro Asp 
15 10 15 

Pro Arg Thr Ser Ser Val Gly Glu Gly Ala Asn Arg His Glu Asn Asp 
30 20 25 30 

Asp Gly Gly Ser Gly Gly Ser Glu He Gly Ala Pro Asp Leu Asp Lys 
35 40 45 

35 Asp Leu Leu Cys Pro He Cys Met Gin He He Lys Asp Ala Phe Leu 

50 55 60 



85 90 95 
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Ala Arg His Val Ser Lys Thr Ala Ser Pro Leu Asp Gin Phe Arg Glu 
115 120 125 

Ala Leu Gin Arg Gly Cys Asp Val Ser lie Lys Glu Val Asp Asn Leu 
5 130 135 140 

Leu Thr Leu Leu Ala Glu Arg Lys Arg Lys Met Glu Gin Glu Glu Ala 
145 150 155 160 

10 Glu Arg Asn Met Gin He Leu Leu Asp Phe Leu His Cys Leu Arg Lys 

165 170 175 

Gin Lys Val Asp Glu Leu Asn Glu Val Gin Thr Asp Leu Gin Tyr He 
180 185 190 

15 

Lys Glu Asp He Asn Ala Val Glu Arg His Arg He Asp Leu Tyr Arg 
195 200 205 

Ala Arg Asp Arg Tyr Ser Val Lys Leu Arg Met Leu Gly Asp Asp Pro 
20 210 215 220 

Ser Thr Arg Asn Ala Trp Pro His Glu Lys Asn Gin He Gly Phe Asn 
225 230 235 240 

25 Ser Asn Ser Leu Ser He Arg Gly Gly Asn Phe Val Gly Asn Tyr Gin 

245 250 255 

Asn Lys Lys Val Glu Gly Lys Ala Gin Gly Ser Ser His Gly Leu Pro 
260 265 270 

30 

Lys Lys Asp Ala Leu Ser Gly Ser Asp Ser Gin Ser Leu Asn Gin Ser 
275 280 285 

Thr Val Ser Met Ala Arg Lys Lys Arg He His Ala Gin Phe Asn Asp 
35 290 295 300 

Leu Gin Glu Cys Tyr Leu Gin Lys Arg Arg Gin Leu Ala Asp Gin Pro 
305 310 315 320 



Ser Asn Gly Leu Ala Asp Phe Gin Ser Val Leu Thr Thr Phe Thr Arg 
340 345 350 
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355 360 365 

His Ser Ala Asn lie Val Ser Ser lie Glu Phe Asp Arg Asp Asp Glu 
370 375 380 

5 

Leu Phe Ala Thr Ala Gly Val Ser Arg Cys He Lys Val Phe Asp Phe 
385 390 395 400 

Ser Ser Val Val Asn Glu Pro Ala Asp Met Gin Cys Pro He Val Glu 
10 405 410 415 

Met Ser Thr Arg Ser Lys Leu Ser Cys Leu Ser Trp Asn Lys His Glu 
420 425 430 

15 Lys Asn His He Ala Ser Ser Asp Tyr Glu Gly He Val Thr Val Trp 

435 440 445 

Asp Val Thr Thr Arg Gin Ser Leu Met Glu Thr Glu Glu Asn Glu Lys 
450 455 460 

20 

Arg Ala Trp Ser Val Asp Phe Ser Arg Thr Glu Pro Ser Met Leu Val 
465 470 475 480 

Ser Gly Ser Asp Asp Cys Lys Val Lys Val Trp Cys Thr Arg Gin Glu 
25 485 490 495 

Ala Ser Val lie Asn He Asp Met Lys Ala Asn He Cys Cys Val Lys 
500 505 510 

30 Tyr Asn Pro Gly Ser Ser Asn Tyr He Ala Val Gly Ser Ala Asp His 

515 520 525 

His lie His Tyr Tyr Asp Leu Arg Asn lie Ser Gin Pro Leu His Val 
530 535 540 



35 



Phe Ser Gly His Lys Lys Ala Val Ser Tyr Met Lys Phe Leu Ser Asn 
545 550 555 560 



;a. Lys Asp Asn Leu Pro Val Arg Thr Phe Arg Gly His Thr Asn Glu 
580 585 590 
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Ser Glu Thr Thr Arg Tyr Val Tyr His Lys Glu lie Thr Arg Pro Val 
610 615 620 

Thr Ser His Arg Phe Gly Ser Pro Asp Met Asp Asp Ala Glu Lys Arg 
5 625 630 635 640 

Gin Val Pro Thr Leu Leu Val Arg Phe Ala Gly Arg Val lie Val Pro 
645 650 655 

10 Arg Cys 

(2) INFORMATION FOR SEQ ID NO: 35: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



25 



(vi) ORIGINAL SOURCE: . 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN, Fig. 18 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Met Ser Lys Val Val Arg Ser Ser Lys Tyr Arg His Val Phe Ala Ala 
15 10 15 

3 5 Gin Pro Lys Lys Glu Glu Cys Tyr Gin Asn Leu Lys Thr Lys Ser Ala 

20 25 30 

Val Trp Asp Ser Asn Tyr Val Ala Ala Asn Thr Ara Tvr Tio a^r 



5 0 5 5 6 0 

Lys Thr Thr Ser Val Pro T.pu Ph<- A^n ^ \' T^n T r 1 > 
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Asp He Ala Phe His Pro Phe Asn Glu Asn Leu Val Gly Ser Val Ser 
85 90 95 

Glu Asp Cys Asn He Cys He Trp Gly He Pro Glu Gly Gly Leu Thr 
5 100 105 110 

Asp Ser He Ser Thr Pro Leu Gin Thr Leu Ser Gly His Lys Arg Lys 
115 120 125 

10 Val Gly Thr He Ser Phe Gly Pro Val Ala Asp Asn Val Ala Val Thr 

130 140 

Ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp Val Glu Gin Gly Lys 
145 150 155 160 

15 

Asn Leu Thr Thr Val Glu Gly His Ser Asp Met He Thr Ser Cys Glu 
165 170 175 

His Asn Gly Ser Gin He Val Thr Thr Cys Lys Asp Lys Lys Ala Arg 
20 180 185 190 

Val Phe Asp Pro Arg Thr Asn Ser He Val Asn Glu Val Val Cys His 
195 200 205 

25 Gin Gly Val Lys Asn Ser Arg Ala He Phe Ala Lys Asp Lys Val He 

210 215 220 

Thr Val Gly Phe Ser Lys Thr Ser Glu Arg Glu Leu His He Tyr Asp 
225 230 235 240 

30 

Pro Arg Ala Phe Thr Thr Pro Leu Ser Ala Gin Val Val Asp Ser Ala 
245 250 255 

Ser Gly Leu Leu Met Pro Phe Tyr Asp Ala Asp Asn Ser He Leu Tyr 
35 260 265 270 

Leu Ala Gly Lys Gly Asp Gly Asn He Arg Tyr Tyr Glu Leu Val Asp 



Gin Arg Gly Leu Cys Phe Leu Pro Lys Arg Cys Leu Asn Thr Ser Glu 



WO 95/21252 



PCI7US95/01210 



5 



- 97 - 

325 '330 335 

He Ser Phe Arg Val Pro Arg Lys Ser Asp He Phe Gin Gly Asp He 
340 345 350 

Tyr Pro Asp Thr Tyr Ala Gly Glu Pro Ser Leu Thr Ala Glu Gin Trp 
355 360 365 

Val Ser Gly Thr Asn Ala Glu Pro Lys Thr Val Ser Leu Ala Gly- Gly 
10 370 375 380 

Phe Val Lys Lys Ala Ser Ala Val Glu Phe Lys Pro Val Val Gin Val 
385 390 395 400 

15 Gin Glu Gly Pro Lys Asn Glu Lys Glu Leu Arg Glu Glu Tyr Glu Lys 

405 410 415 

Leu Lys He Arg Val Ala Tyr Leu Glu Ser Glu He Val Lys Lys Asp 
420 425 430 



20 



25 



30 



Ala Lys He Lys Glu Leu Thr Asn 
435 440 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

fC) INDIVIDUAL ISOLATE : Coronin fpS5) , Fiq. 19 



Met Ser Lys Val Val Arg Ser Ser Lys Tyr Arg His Val Phe Ala Ala 

S ^ 1 5 
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20 25 30 

Ala Trp Asp Ser Asn Tyr Val Ala Ala Asn Thr Arg Tyr Phe Gly Val 
35 40 45 

5 

He Trp Asp Ala Ala Gly Gly Gly Ser Phe Ala Val He Pro His Glu 
50 55 60 

Ala Ser Gly Lys Thr Thr Ser Val Pro Leu Phe Asn Gly His Lys Ser 
10 65 70 75 80 

Ala Val Leu Asp He Ala Phe His Pro Phe Asn Glu Asn Leu Val Gly 
85 90 95 

15 Ser Val Ser Glu Asp Cys Asn He Cys He Trp Gly He Pro Glu Gly 

100 105 110 

Gly Leu Thr Asp Ser He Ser Thr Pro Leu Gin Thr Leu Ser Gly His 
115 120 125 

20 

Lys Arg Lys Val Gly Thr He Ser Phe Gly Pro Val Ala Asp Asn Val 
130 135 140 

Ala Val Thr Ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp Val Glu 
25 145 150 155 160 

Gin Gly Lys Asn Leu Thr Thr Val Glu Gly His Ser Asp Met He Thr 
165 170 175 

30 Ser Cys Glu Trp Asn His Asn Gly Ser Gin He Val Thr Thr Cys Lys 

180 185 190 

Asp Lys Lys Ala Arg Val Phe Asp Pro Arg Thr Asn Ser He Val Asn 
195 200 205 



35 



Glu Val Val Cys His Gin Gly Val Lys Asn Ser Arg Ala He Phe Ala 
210 215 220 



L eu His lie Tyr Asp Pro Arg A^a i j :\e Thi Thr tro ^eu Sei Ala Gi:i 
245 250 255 
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Asn Ser He Leu Tyr Leu Ala Gly Lys Gly Asp Gly Asn He Arg Tyr 
275 280 285 

Tyr Glu Leu Val Asp Glu Ser Pro Tyr He His Phe Leu Ser Glu Phe 
5 290 295 300 

Lys Ser Ala Thr Pro Gin Arg Gly Leu Cys Phe Leu Pro Lys Arg Cys 
305 310 315 320 

10 Leu Asn Thr Ser Glu Cys Glu He Ala Arg Gly Leu Lys Val Thr Pro 



Phe Thr Val Glu Pro lie Ser Phe Arg Val Pro Arg Lys Ser Asp He 
340 345 350 



15 



Phe Gin Gly Asp He Tyr Pro Asp Thr Tyr Ala Gly Glu Pro Ser Leu 
355 360 365 

Thr Ala Glu Gin Trp Val Ser Gly Thr Asn Ala Glu Pro Lys Thr Val 
20 370 375 380 

Ser Leu Ala Gly Gly Phe Val Lys Lys Ala Ser Ala Val Glu Phe Lys 
385 390 395 400 

25 Pro Val Val Gin Val Gin Glu Gly Pro Lys Asn Glu Lys Glu Leu Arg 

405 410 415 

Glu Glu Tyr Glu Lys Leu Lys He Arg Val Ala Tyr Leu Glu Ser Glu 
420 425 430 

30 

He Val Lys Lys Asp Ala Lys He Lys Glu Leu Thr Asn 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 37: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 
W TYPF ■ amirtn arid 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 50kDa, Fig. 20 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Met Tyr Arg Thr Lys Val Gly Leu Lys Asp Arg Gin Gin Leu Tyr Lys 
15 10 15 

10 Leu lie He Ser Gin Leu Leu Tyr Asp Gly Tyr He Ser He Ala Asn 

20 25 30 

Gly Leu He Asn Glu He Lys Pro Gin Ser Val Cys Ala Pro Ser Glu 
35 40 45 

15 

Gin Leu Leu His Leu He Lys Leu Gly Met Glu Asn Asp Asp Thr Ala 
50 55 60 

Val Gin Tyr Ala He Gly Arg Ser Asp Thr Val Ala Pro Gly Thr Gly 
20 65 70 75 80 

He Asp Leu Glu Phe Asp Ala Asp Val Gin Thr Met Ser Pro Glu Ala 
85 90 95 

25 Ser Glu Tyr Glu Thr Cys Tyr Val Thr Ser His Lys Gly Pro Cys Arg 

100 105 110 

Val Ala Thr Tyr Ser Arg Asp Gly Gin Leu He Ala Thr Gly Ser Ala 

115 120 125 

30 

Asp Ala Ser He Lys He Leu Asp Thr Glu Arg Met Leu Ala Lys Ser 
130 135 140 

Ala Met Pro He Glu Val Met Met Asn Glu Thr Ala Gin Gin Asn Met 
35 145 150 155 160 

Glu Asn His Pro Val He Arg Thr Leu Tyr Asp His Val Asp Glu Val 
165 170 175 



Arg Asp Tyr Thr Leu Lys Leu Phe Asp Tyr Ser Lys Pro Ser Ala Lys 

195 200 205 
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210 215 220 

Phe His Pro Ser Gly Asp Phe lie Leu Val Gly Thr Gin His Pro Thr 
225 230 235 240 

5 

Leu Arg Leu Tyr Asp lie Asn Thr Phe Gin Cys Phe Val Ser Cys Asn 
245 250 255 

Pro Gin Asp Gin His Thr Asp Ala lie Cys Ser Val Asn Tyr Asn Ser 
10 260 265 270 

Ser Ala Asn Met Tyr Val Thr Gly Ser Lys Asp Gly Cys lie Lys Leu 
275 280 285 

15 Trp Asp Gly Val Ser Asn Arg Cys lie Thr Thr Phe Glu Lys Ala His 

290 295 300 

Asp Gly Ala Glu Val Cys Ser Ala lie Phe Ser Lys Asn Ser Lys Tyr 
305 310 315 320 

20 

lie Leu Ser Ser Gly Lys Asp Ser Val Ala Lys Leu Trp Glu lie Ser 
325 330 335 

Thr Gly Arg Thr Leu Val Arg Tyr Thr Gly Ala Gly Leu Ser Gly Arg 
25 340 345 350 

Gin Val His Arg Thr Gin Ala Val Phe Asn His Thr Glu Asp Tyr Val 
355 360 365 

3 0 Leu Leu Pro Asp Glu Arg Thr lie Ser Leu Cys Cys Trp Asp Ser Arg 

370 375 380 

Thr Ala Glu Arg Arg Asn Leu Leu Ser Leu Gly His Asn Asn lie Val 
385 390 395 400 



35 



Arg Cys lie Val His Ser Pro Thr Asn Pro Gly Phe Met Thr Cys Ser 
405 410 415 



INFORMATION FOR SEQ ID NO : 3 8 : 
'i^ SEQUENCE CHARACTERISTICS: 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: G-Beta 1 bovine, Fig. 21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

15 Met Ser Glu Leu Asp Gin Leu Arg Gin Glu Ala Glu Gin Leu Lys Asn 

15 10 15 

Gin He Arg Asp Ala Arg Lys Ala Cys Ala Asp Ala Thr Leu Ser Gin 
20 25 30 

20 

He Thr Asn Asn He Asp Pro Val Gly Arg He Gin Met Arg Thr Arg 
35 40 45 

Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly 
25 50 55 60 

Thr Asp Ser Arg Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie 
65 70 75 80 

30 He Trp Asp Ser Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg 

85 90 95 

Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Tyr Val 
100 105 110 



35 



Ala Cys Gly Gly Leu Asp Asn He Cys Ser He Tyr Asn Leu Lys Thr 
115 120 125 



Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin He Val Thr Ser 
145 150 155 160 
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Thr Thr Thr Phe Thr Gly His Thr Gly Asp Val Met Ser Leu Ser Leu 
180 185 190 

Ala Pro Asp Thr Arg Leu Phe Val Ser Gly Ala Cys Asp Ala Ser Ala 
5 195 200 205 

Lys Leu Trp Asp Val Arg Glu Gly Met Cys Arg Gin Thr Phe Thr Gly 
210 215 220 

10 His Glu Ser Asp lie Asn Ala lie Cys Phe Phe Pro Asn Gly Asn Ala 

225 230 235 240 

Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp Leu Arg 
245 250 255 

15 

Ala Asp Gin Glu Leu Met Thr Tyr Ser His Asp Asn lie lie Cys Gly 
260 265 270 

lie Thr Ser Val Ser Phe Ser Lys Ser Gly Arg Leu Leu Leu Ala Gly 
20 275 280 285 

Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Ala Leu Lys Ala Asp Arg 
290 295 300 

2 5 Ala Gly Val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 

305 310 315 320 

Thr Asp Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu 

325 330 335 

30 

Lys lie Trp Asn 
340 

(2) INFORMATION FOR SEQ ID NO: 39: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 amino acids 

(B) TYPE: amino acid 



(lil) HYPOTHETICAL: NO 



WO 95/21252 



PCT/US95/01210 



- 104 - 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta- bovine (2), Fig. 22 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 

Arg Asn Gin lie Arg Asp Ala Arg Lys Ala Cys Gly Asp Ser Thr Leu 
15 10 15 

10 Thr Gin lie Thr Ala Gly Leu Asp Pro Val Gly Arg He Gin Met Arg 

20 25 30 

Thr Arg Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His 
35 40 45 

15 

Trp Gly Thr Asp Ser Arg Leu Leu Val Ser Ala Ser Gin Asp Gly Lys 
50 55 60 

Leu He He Trp Asp Ser Glu Gly Asn Val Arg Tyr Thr Thr Asn Lys 
20 65 70 75 80 

Val His Ala He Pro Leu Arg Ser Ser Trp Val Met Thr Cys Ala Tyr 
85 90 95 

2 5 Ala Pro Ser Gly Asn Phe Val Ala Cys Gly Gly Leu Asp Asn He Cys 

100 105 110 

Ser He Tyr Ser Leu Lys Thr Arg Val Ser Arg Glu Leu Pro Gly His 
115 120 125 

30 

Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin lie He 
130 135 140 

Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp He Glu Thr Gly 
35 145 150 155 160 

Gin Gin Thr Val Gly Phe Ala Gly His Ser Gly Asp Val Met Ser Leu 
165 170 



Ser He Lys Leu Trp Asp Val Arg Asp Ser Met Cys Arg Gin Thr Phe 

195 7 HO ^ nc; 
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210 215 220 

Tyr Ala Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
225 230 235 240 

5 

Leu Arg Ala Asp Gin Glu Leu Leu Met Tyr Ser His Asp Asn lie lie 
245 250 255 

Cys Gly lie Thr Ser Val Ala Phe Ser Arg Ser Gly Arg Leu Leu Leu 
10 260 265 270 

Ala Gly Tyr Asp Asp Phe Asn Cys Asn He Trp Asp Ala Met Lys Gly 
275 280 285 

15 Asp Arg Ala Gly Val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu 

290 295 300 

Gly Val Thr Asp Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser 
305 310 315 320 



20 



25 



30 



Phe Leu Lys He Trp Asn 
325 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE : G- BETA DROSOPH Fin ?i 



Met Asn Glu Leu Asp Ser Leu Arg Gin Glu Ala Glu Ser Leu Lys Asn 

1 5 
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20 25 30 

Ala Ala Thr Ser Leu Glu Pro He Gly Arg He Gin Met Arg Thr Arg 
35 40 45 

5 

Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly 
50 55 60 

Asn Asp Ser Arg Asn Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie 
10 65 70 75 80 

Val Trp Asp Ser His Thr Thr Asn Lys Val His Ala He Pro Leu Arg 
85 90 95 

15 Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Ser Tyr Val 

100 105 110 

Ala Cys Gly Gly Leu Asp Asn Met Cys Ser He Tyr Asn Leu Lys Thr 
115 120 125 

20 

Arg Glu Gly Asn Val Arg Val Ser Arg Glu Leu Pro Gly His Gly Gly 
130 135 140 

Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin He Val Thr Ser 
25 145 150 155 160 

Ser Gly Asp Met Ser Cys Gly Leu Trp Asp He Glu Thr Gly Leu Gin 
165 170 175 

3 0 Val Thr Ser Phe Leu Gly His Thr Gly Asp Val Met Ala Leu Ser Leu 

180 185 190 

Ala Pro Gin Cys Lys Thr Phe Val Ser Gly Ala Cys Asp Ala Ser Ala 
195 200 205 



35 



Lys Leu Trp Asp He Arg Glu Gly Val Cys Lys Gin Thr Phe Pro Gly 
210 215 220 



't : he Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp He Arg 
245 250 255 
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lie Thr Ser Val Ala Phe Ser Lys Ser Gly Arg Leu Leu Leu Ala Gly 
275 280 285 

Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Thr Met Lys Ala Glu Arg 
5 290 295 300 

Ser Gly lie Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly Val 
305 310 315 320 

10 Thr Glu Asn Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu 



Arg Val Trp Asn 
340 



15 



(2) INFORMATION FOR SEQ ID NO: 41: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: G-BETA HUMAN , Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

35 Met Thr Glu Gin Met Thr Leu Arg Gly Thr Leu Lys Gly His Asn Gly 

15 10 15 



35 



40 



45 
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Phe Val Ser Asp Val Val He Ser Ser' Asp Gly Gin Phe Ala Leu Ser 
65 70 75 80 

Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Thr Thr Gly Thr 
85 90 9$ 

Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val Ala 
100 105 HO 



10 



Phe Ser Ser Asp Asn Arg Gin He Val Ser Gly Ser Arg Asp Lys Thr 
115 120 125 



15 



He Lys Leu Trp Asn Thr Leu Gly Val Cys Lys Tyr Thr Val Gin Asp 

130 135 140 

Glu Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser 

145 150 155 160 



20 



Ser Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val 
165 170 175 



Trp Asn Leu Ala Asn Cys Lys Leu Lys Thr Asn His He Gly His Thr 
180 185 190 



25 



Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 
195 200 205 



30 



Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp Leu Asn Glu Gly 

210 215 220 

Lys His Leu Tyr Thr Leu Asp Gly Gly Asp He He Asn Ala Leu Cys 

225 230 235 240 



35 



Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 
245 250 255 



Lys He Trp Asp Leu Glu Gly Lys He He Val Asp Glu Leu Lys Gin 
260 265 270 



Leu Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp 
Tqn 295 300 
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305 310 315 

(2) INFORMATION FOR SEQ ID NO: 42: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 340 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 2 (Human), Fig. 25 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Met Ser Glu Leu Glu Gin Leu Arg Gin Glu Ala Glu Gin Leu Arg Asn 
15 10 15 

25 Gin lie Arg Asp Ala Arg Lys Ala Cys Gly Asp Ser Thr Leu Thr Gin 

20 25 30 

He Thr Ala Gly Leu Asp Pro Val Gly Arg He Gin Met Arg Thr Arg 

35 40 45 

30 

Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly 
50 55 60 

Thr Asp Ser Arg Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He 
35 65 70 75 80 

He Trp Asp Ser Tyr Thr Thr Asn Lys Val His Ala He Pro Leu Arg 



Ala Cys Gly Gly Leu Asp Asn He Cys Ser He Tyr Ser Leu Lys Thr 
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- 110 
135 



140 



Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin lie lie Thr Ser 

145 150 155 160 

Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp lie Glu Thr Gly Gin Gin 

165 170 175 



10 



Thr Val Gly Phe Ala Gly His Ser Gly Asp Val Met Ser Leu Ser Leu 
180 185 190 



Ala Pro Asp Gly Arg Thr Phe Val Ser Gly Ala Cys Asp Ala Ser He 
195 200 205 



15 



Lys Leu Trp Asp Val Arg Asp Ser Met Cys Arg Gin Thr Phe He Gly 
210 215 220 



20 



His Glu Ser Asp He Asn Ala Val Ala Phe Phe Pro Asn Gly Tyr Ala 
225 230 235 240 

Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp Leu Arg 
245 250 255 



25 



Ala Asp Gin Glu Leu Leu Met Tyr Ser His Asp Asn He He Cys Gly 
260 265 270 



He Thr Ser Val Ala Phe Ser Arg Ser Gly Arg Leu Leu Leu Ala Gly 
275 280 285 



3 0 Tyr Asp Asp Phe Asn 

290 

Ala Gly Val Leu Ala 

305 

35 

Thr Asp Asp Gly Met 
325 



Cys Asn He Trp Asp Ala 
295 

Gly His Asp Asn Arg Val 
310 315 

Ala Val Ala Thr Gly Ser 
330 



Met Lys Gly Asp Arg 
300 

Ser Cys Leu Gly Val 
320 

Trp Asp Ser Phe Leu 

335 



[2) INFORMATION FOR SEQ ID NO: 43 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : protein 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: G-Beta 4 (mouse), Fig. 26 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

15 Lys Lys Asx Glu Thr Asx Val Asn Met Gly Arg Tyr Thr Pro Arg lie 

15 10 15 

Lys His lie Lys Arg Pro Arg Arg Thr Asp Xaa Xaa Gly 

20 25 



20 



(2) INFORMATION FOR SEQ ID N0:44: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

3 0 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: GROUCHO PROTEIN DROSOPH, Fig. 27 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Gin Gly Pro lie Lys Phe Thr lie Ala Asp Thr Leu Glu Arg He Lys 
20 25 30 
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40 



45 



Cys Glu Lys Leu Ser Asn Glu Lys Thr Glu Met Gin Arg His Tyr Val 
50 55 60 

Met Tyr Tyr Glu Met Ser Tyr Gly Leu Asn Val Glu Met His Lys Gin 
65 70 75 80 



10 



Thr Glu He Ala Lys Arg Leu Asn Thr Leu He Asn Gin Leu Leu Pro 
85 90 95 



Phe Leu Gin Ala Asp His Gin Gin Gin Val Leu Gin Ala Val Glu Arg 
100 105 HO 



15 



Ala Lys Gin Val Thr Met Gin Glu Leu Asn Leu He He Gly Gin Gin 
115 120 125 



20 



He His Ala Gin Gin Val Pro Gly Gly Pro Pro Gin Pro Met Gly Ala 
130 135 140 

Leu Asn Pro Phe Gly Ala Leu Gly Ala Thr Met Gly Leu Pro His Gly 
145 150 155 160 



25 



Pro Gin Gly Leu Leu Asn Lys Pro Pro Glu His His Arg Pro Asp He 
165 170 175 



Lys Pro Thr Gly Leu Glu Gly Pro Ala Ala Ala Glu Glu Arg Leu Arg 
180 185 190 



30 



Asn Ser Val Ser Pro Ala Asp Arg Glu Lys Tyr Arg Thr Arg Ser Pro 
195 200 205 



35 



Leu Asp He Glu Asn Asp Ser Lys Arg Arg Lys Asp Glu Lys Leu Gin 
210 215 220 

Glu Asp Glu Gly Glu Lys Ser Asp Gin Asp Leu Val Val Asp Val Ala 

225 230 235 240 



Met Glu Val Arg Asp Arg Glu Ser Leu Asn Gly Glu Arg Leu Glu Lys 
260 265 270 
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Ser Ser Ser Ser Arg Ser Thr Pro Ser Leu Lys Thr Lys Asp Met Glu 
290 295 300 

Lys Pro Gly Thr Pro Gly Ala Lys Ala Arg Thr Pro Thr Pro Asn Ala 
5 305 310 315 320 

Ala Ala Pro Ala Pro Gly Val Asn Pro Lys Gin Met Met Pro Gin Gly 
325 330 335 

10 Pro Pro Pro Ala Gly Tyr Pro Gly Ala Pro Tyr Gin Arg Pro Ala Asp 

340 345 350 

Pro Tyr Gin Arg Pro Pro Ser Asp Pro Ala Tyr Gly Arg Pro Pro Pro 
355 360 365 

15 

Met Pro Tyr Asp Pro His Ala His Val Arg Thr Asn Gly He Pro His 
370 375 380 

Pro Ser Ala Leu Thr Gly Gly Lys Pro Ala Tyr Ser Phe His Met Asn 
20 385 390 395 400 

Gly Glu Gly Ser Leu Gin Pro Val Pro Phe Pro Pro Asp Ala Leu Val 
405 410 415 

2 5 Gly Val Gly He Pro Arg His Ala Arg Gin He Asn Thr Leu Ser His 

420 425 430 

Gly Glu Val Val Cys Ala Val Thr He Ser Asn Pro Thr Lys Tyr Val 
435 440 445 

30 

Tyr Thr Gly Gly Lys Gly Cys Val Lys Val Trp Asp He Ser Gin Pro 
450 455 460 

Gly Asn Lys Asn Pro Val Ser Gin Leu Asp Cys Leu Gin Arg Asp Asn 
35 465 470 475 480 

Tyr lie Arg Ser Val Lys Leu Leu Pro Asp Gly Arg Thr Leu He Val 
485 490 495 



Pro Arg He Lys Ala Glu Leu Thr Ser Ala Ala Pro Ala Cys Tyr Ala 
515 520 525 
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Asn He Ala Val Trp 

545 

5 

Gin Gly His Thr Asp 
565 



- 114 - 
535 

Asp Leu His Asn Glu He 
550 555 

Gly Ala Ser Cys He Asp 
570 



540 

Leu Val Arg Gin Phe 
560 

He Ser Pro Asp Gly 
575 



Ser Arg Leu Trp Thr Gly Gly Leu Asp Asn Thr Val Arg Ser Trp Asp 
10 580 585 590 

Leu Arg Glu Gly Arg Gin Leu Gin Gin His Asp Phe Ser Ser Gin lie 

595 600 605 

15 Phe Ser Leu Gly Tyr Cys Pro Thr Gly Asp Trp Leu Ala Val Gly Met 

610 615 620 



Glu Asn Ser His Val Glu Val Leu His Ala Ser Lys Pro Asp Lys Tyr 
625 630 635 640 

20 

Gin Leu His Leu His Glu Ser Cys Val Leu Ser Leu Arg Phe Ala Ala 
645 650 655 



Cys Gly Lys Trp Phe Val Ser Thr Gly Lys Asp Asn Leu Leu Asn Ala 
25 660 665 670 

Trp Arg Thr Pro Tyr Gly Ala Ser He Phe Gin Ser Lys Glu Thr Ser 
675 680 685 

30 Ser Val Leu Ser Cys Asp He Ser Thr Asp Asp Lys Tyr He Val Thr 

690 695 700 



Gly Ser Gly Asp Lys Lys Ala Thr Val Tyr Glu Val He Tyr 
705 710 715 



35 



(2) INFORMATION FOR SEQ ID NO: 45: 



( D ) TOPOLOGY : unknown 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding protein (squid) , Fig. 28 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Thr Ser Glu Leu Glu Ala Leu Arg Gin Glu Thr Glu Gin Leu Lys 
1 5 10 15 ' 

10 

a pin Gin He Arc* Glu Ala Arc* Lys Ala Ala Ala Asp Thr Thr Leu Ala 
20 25 30 

Met Ala Thr Ala Asn Val Glu Pro Val Gly Arg He Gin Met Arg Thr 
15 35 40 45 

Arg Arg Thr Leu Arg Gly His Leu Ala Lys He Tyr Ala Met His Trp 
50 55 60 

20 Ala Ser Asp Ser Arg Asn Leu Val Ser Ala Ser Gin Asp Gly Lys Leu 

65 70 75 80 

He Val Trp Asp Gly Tyr Thr Thr Asn Lys Val His Ala lie Pro Leu 
85 90 95 

25 

Arg Ser Ser Trp Val Met Thr Cys Ala Tyr Ala Pro Ser Gly Asn Tyr 
100 105 110 

Val Ala Cys Gly Gly Leu Asp Asn lie Cys Ser He Tyr Ser Leu Lys 
30 115 120 125 

Thr Arg Glu Gly Asn Val Arg Val Ser Arg Glu Leu Pro Gly His Thr 
130 135 140 

35 Gly Tyr Leu Ser Cys Cys Arg Phe He Asp Asp Asn Gin He Val Thr 

145 150 155 160 

Ser Ser Olv Asp Met Thr fvs M * T pu tt- p ti ^ n.i ^ rjv n<?^ 



180 



185 



190 
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Ala Lys Leu Phe Asp lie Arg Asp Gly lie Cys Lys Gin Thr Phe Thr 

210 215 220 

Gly His Glu Ser Asp lie Asn Ala lie Thr Tyr Phe Pro Asn Gly Phe 
5 225 230 235 240 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp lie 
245 250 255 

10 Arg Ala Asp Gin Glu He Gly Met Tyr Ser His Asp Asn He He Cys 

260 265 270 

Gly He Thr Ser Val Ala Phe Ser Lys Ser Gly Arg Leu Leu Leu Gly 
275 280 285 



15 



Gly Tyr Asp Asp Phe Asn Cys Asn Val Trp Asp Val Leu Lys Gin Glu 
290 295 300 



Arg Ala Gly Val Leu Ala Gly His Asp Asn Arg Val Ser Cys Leu Gly 
20 305 310 315 320 

Val Thr Glu Asp Gly Met Ala Val Ala Thr Gly Ser Trp Asp Ser Phe 
325 330 335 

25 Leu Lys He Trp Asn 

340 3 

(2) INFORMATION FOR SEQ ID NO: 46: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 



(C) INDIVIDUAL ISOLATE: IEF SSP 9306, Fig. 29 
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Met Ala Asp Lys Glu Ala Ala Phe Asp Asp Ala Val Glu Glu Arg Val 
15 10 15 

lie Asn Glu Glu Tyr Lys lie Trp Lys Lys Asn Thr Pro Phe Leu Tyr 
20 25 30 

Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 

35 40 45 



10 



Trp Leu Pro Asp Val Thr Arg Pro Glu Gly Lys Asp Phe Ser lie His 
50 55 60 



15 



Arg Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Val 

65 70 75 80 

lie Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 
85 90 95 



20 



His Tyr Asp Ser Glu Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Ser 
100 105 110 



Gly Lys He Glu He Glu He Lys He Asn His Glu Gly Glu Val Asn 
115 120 125 



25 



Arg Ala Arg Tyr Met Pro Gin Asn Pro Cys He He Ala Thr Lys Thr 
130 135 140 



30 



Pro Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 

145 150 155 160 

Pro Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His 
165 170 175 



35 



Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 
180 185 190 



Leu Leu Ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp He Ser 

195 200 205 



Gly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Giu 

225 230 235 240 
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245 ' 250 255 

Thr Arg Ser Asn Asn Thr Ser Lys Pro Ser His Ser Val Asp Ala His 
260 265 270 

5 

Thr Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe lie 
275 280 285 

Leu Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Trp Asp Leu Arg 
10 290 295 300 

Asn Leu Lys Leu Lys Leu His Ser Phe Glu Ser His Lys Asp Glu lie 
305 310 315 320 

15 Phe Gin Val Gin Trp Ser Pro His Asn Glu Thr He Leu Ala Ser Ser 

325 330 335 

Gly Thr Asp Arg Arg Leu Asn Val Trp Asp Leu Ser Lys He Gly Glu 
340 345 350 

20 

Glu Gin Ser Pro Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe 
355 360 365 

He His Gly Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro 
25 370 375 380 

Asn Glu Pro Trp Val He Cys Ser Val Ser Glu Asp Asn lie Met Gin 
385 390 395 400 

3 0 Val Trp Gin Met Glu Leu Val Leu Asp His 

405 410 

(2) INFORMATION FOR SEQ ID NO: 47: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 amino acids 

(B) TYPE: amino acid 



[ill! HYPOTHETICAL: NO 
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(C) INDIVIDUAL ISOLATE: HUMAN 12.3, Fig. 30 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

5 

Met Thr Glu Gin Met Thr Leu Arg Gly Thr Leu Lys Gly His Asn Gly 
15 10 15 

Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro Asp Met lie Leu 
10 20 25 30 

Ser Ala Ser Arg Asp Lys Thr lie lie Met Trp Lys Leu Thr Arg Asp 
35 40 45 

15 Glu Thr Asn Tyr Gly lie Pro Gin Arg Ala Leu Arg Gly His Ser His 

50 55 60 

Phe Val Ser Asp Val Val lie Ser Ser Asp Gly Gin Phe Ala Leu Ser 
65 70 75 80 

20 

Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp Leu Thr Thr Gly Thr 
85 90 95 

Thr Thr Arg Arg Phe Val Gly His Thr Lys Asp Val Leu Ser Val Ala 
25 100 105 110 

Phe Ser Ser Asp Asn Arg Gin lie Val Ser Gly Ser Arg Asp Lys Thr 
115 120 125 

3 0 lie Lys Leu Trp Asn Thr Leu Gly Val Cys Lys Tyr Thr Val Gin Asp 

130 135 140 

Glu Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser 
145 150 155 160 



35 



Ser Asn Pro lie lie Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val 
165 170 175 



Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser Leu Cys Ala 
195 200 205 
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Lys His Leu Tyr Thr Leu Asp Gly Gly Asp lie lie Asn Ala Leu Cys 

225 230 235 240 

Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser He 
5 245 250 255 

Lys He Trp Asp Leu Glu Gly Lys He He Val Asp Glu Leu Lys Gin 

260 265 270 

10 Glu Val He Ser Thr Ser Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser 

275 280 285 

Leu Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp 
290 295 300 



15 



20 



25 



35 



Asn Leu Val Arg Val Trp Gin Val Thr He Gly Thr Arg 
305 310 315 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: IEF -7442 - human, Fig. 31 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Met Ala Ser Lvs Gin Mef Pho nin App Thr Vai t^- *n , 



20 25 30 
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Leu Pro Glu Val Thr Lys Pro Glu Gly Lys Asp Tyr Ala Leu His Trp 
50 55 60 

Leu Val Leu Gly Thr His Thr Ser Asp Glu Gin Asn His Leu Val Val 
5 65 70 75 80 

Ala Arg Val His He Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser His 

85 90 95 

10 Cys Asp Ser Asp Lys Gly Glu Phe Gly Gly Phe Gly Ser Val Thr Gly 

100 105 110 

Lys He Glu Cys Glu He Lys He Asn His Glu Gly Glu Val Asn Arg 
115 120 125 

15 

Ala Arg Tyr Met Pro Gin Asn Pro His He He Ala Thr Lys Thr Pro 
130 135 140 

Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ala Lys Pro 
20 145 150 155 160 

Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His Gin 
165 170 175 

25 Lys Glu Gly Tyr Gly Leu Ser Trp Asn Ser Asn Leu Ser Gly His Leu 

180 185 190 

Leu Ser Ala Ser Asp Asp His Thr Val Cys Leu Trp Asp He Asn Ala 

195 200 205 

30 

Gly Pro Lys Glu Gly Lys He Val Asp Ala Lys Ala He Phe Thr Gly 
210 215 220 

His Ser Ala Val Val Glu Asp Val Ala Trp His Leu Leu His Glu Ser 
35 225 230 235 240 

Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp Thr 
24^ 



Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe He Leu 
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290 295 300 

Leu Lys Leu Lys Leu His Thr Phe Glu Ser His Lys Asp Glu lie Phe 
305 310 315 320 

5 

Gin Val His Trp Ser Pro His Asn Glu Thr lie Leu Ala Ser Ser Gly 
325 330 335 

Thr Asp Arg Arg Leu Asn Val Trp Asp Leu Ser Lys lie Gly Glu Glu 
10 340 345 350 

Gin Ser Ala Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe lie 

355 360 365 

15 His Gly Gly His Thr Ala Lys lie Ser Asp Phe Ser Trp Asn Pro Asn 

370 375 380 

Glu Pro Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin He 
385 390 395 400 

20 

Trp Gin Met Ala Glu Asn He Tyr Asn Asp Glu Glu Ser Asp Val Thr 
405 410 415 

Thr Ser Glu Leu Glu Gly Gin Gly Ser 
25 420 425 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 60 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



35 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



protein complex, Fig. i'l 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 
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15 10 15 

Trp Val Ala Leu Gly Pro Arg Ser Leu Glu Gly Ala Asp Pro Gly Thr 

20 25 30 

5 

Pro Gly Glu Ala Glu Gly Pro Ala Cys Pro Ala Ala Cys Val Cys Ser 
35 40 45 

Tyr Asp Asp Asp Ala Asp Glu Leu Ser Val Phe Cys Ser Ser Arg Asn 
10 50 55 60 

Leu Thr Arg Leu Pro Asp Gly Val Pro Gly Gly Thr Gin Ala Leu Trp 
65 70 75 80 

15 Leu Asp Gly Asn Asn Leu Ser Ser Val Pro Pro Ala Ala Phe Gin Asn 

85 90 95 

Leu Ser Ser Leu Gly Phe Leu Asn Leu Gin Gly Gly Gin Leu Gly Ser 
100 105 110 

20 

Leu Glu Pro Gin Ala Leu Leu Gly Leu Glu Asn Leu Cys His Leu His 
115 120 125 

Leu Glu Arg Asn Gin Leu Arg Ser Leu Ala Leu Gly Thr Phe Ala His 
25 130 135 140 

Thr Pro Ala Leu Ala Ser Leu Gly Leu Ser Asn Asn Arg Leu Ser Arg 
145 150 155 160 

3 0 Leu Glu Asp Gly Leu Phe Glu Gly Leu Gly Ser Leu Trp Asp Leu Asn 

165 170 175 

Leu Gly Trp Asn Ser Leu Ala Val Leu Pro Asp Ala Ala Phe Arg Gly 
180 185 190 
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Leu Gly Ser Leu Arg Glu Leu Val Leu Ala Gly Asn Arg Leu Ala Tyr 
195 200 205 



Leu Ser Arg Asn Ala Leu Arg Ala He Lys Ala Asn Val Phe Val Gin 
225 230 235 240 
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Val Ala Pro Gly Ala Phe Leu Gly Leu Lys Ala Leu Arg Trp Leu Asp 
260 265 270 

Leu Ser His Asn Arg Val Ala Gly Leu Leu Glu Asp Thr Phe Pro Gly 
5 275 280 285 

Leu Leu Gly Leu Arg Val Leu Arg Leu Ser His Asn Ala lie Ala Ser 
290 295 300 

10 Leu Arg Pro Arg Thr Phe Lys Asp Leu His Phe Leu Glu Glu Leu Gin 

305 310 315 

Leu Gly His Asn Arg lie Arg Gin Leu Ala Glu Arg Ser Phe Glu Gly 

325 330 335 

15 

Leu Gly Gin Leu Glu Val Leu Thr Leu Asp His Asn Gin Leu Gin Glu 
340 345 350 

Val Lys Ala Gly Ala Phe Leu Gly Leu Thr Asn Val Ala Val Met Asn 
20 355 360 365 

Leu Ser Gly Asn Cys Leu Arg Asn Leu Pro Glu Gin Val Phe Arg Gly 
370 375 380 

25 Leu Gly Lys Leu His Ser Leu His Leu Glu Gly Ser Cys Leu Gly Arg 

385 390 395 400 

lie Arg Pro His Thr Phe Thr Gly Leu Ser Gly Leu Arg Arg Leu Phe 

405 410 415 

30 

Leu Lys Asp Asn Gly Leu Val Gly He Glu Glu Gin Ser Leu Trp Gly 
420 425 430 

Leu Ala Glu Leu Leu Glu Leu Asp Leu Thr Ser Asn Gin Leu Thr His 
35 435 440 445 

Leu Pro His Arg Leu Phe Gin Gly Leu Gly Lys Leu Glu Tyr Leu Leu 



Leu Gin Arg Ala Phe Trp Leu Asp Val Ser His Asn Arg Leu Glu Ala 
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500 505' 510 

Leu Arg Asn Asn Ser Leu Arg Thr Phe Thr Pro Qln Pro Pro Gly Leu 

515 520 525 

5 

Glu Arg Leu Trp Leu Glu Gly Asn Pro Trp Asp Cys Gly Cys Pro Leu 
530 535 540 

Lys Ala Leu Arg Asp Phe Ala Leu Gin Asn Pro Ser Ala Val Pro' Arg 
10 545 550 555 560 

Phe Val Gin Ala lie Cys Glu Gly Asp Asp Cys Gin Pro Pro Ala Tyr 
565 570 575 

15 Thr Tyr Asn Asn lie Thr Cys Ala Ser Pro Pro Glu Val Val Gly Leu 

580 585 590 

Asp Leu Arg Asp Leu Ser Glu Ala His Phe Ala Pro Cys 
595 600 605 



20 



(2) INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 603 amino acids 

2 5 (B) TYPE : amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

3 0 <iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: Insulin-like growth factor bind. 

pro. complex-rat, Fig. 33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 50 : 



Trp Val Ala Leu Gly Pro Cys His Leu Gin Gly Thr Asp Pro Gly Ala 

20 25 ^0 
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35 40 45 

His Asp Asp Tyr Thr Asp Glu Leu Ser Val Phe Cys Ser Ser Lys Asn 

50 55 60 

5 

Leu Thr His Leu Pro Asp Asp lie Pro Val Ser Thr Arg Ala Leu Trp 
65 70 75 80 

Leu Asp Gly Asn Asn Leu Ser Ser He Pro Ser Ala Ala Phe Gin Asn 
10 85 90 95 

Leu Ser Ser Leu Asp Phe Leu Asn Leu Gin Gly Ser Trp Leu Arg Ser 
100 105 110 

15 Leu Glu Pro Gin Ala Leu Leu Gly Leu Gin Asn Leu Tyr Tyr Leu His 

115 120 125 

Leu Glu Arg Asn Arg Leu Arg Asn Leu Ala Val Gly Leu Phe Thr His 
130 135 140 

20 

Thr Pro Ser Leu Ala Ser Leu Ser Leu Ser Ser Asn Leu Leu Gly Arg 
145 150 155 160 

Leu Glu Glu Gly Leu Phe Gin Gly Leu Ser His Leu Trp Asp Leu Asn 
25 165 170 175 

Leu Gly Trp Asn Ser Leu Val Val Leu Pro Asp Thr Val Phe Gin Gly 
180 185 190 

3 0 Leu Gly Asn Leu His Glu Leu Val Leu Ala Gly Asn Lys Leu Thr Tyr 

195 200 205 

Leu Gin Pro Ala Leu Phe Cys Gly Leu Gly Glu Leu Arg Glu Leu Asp 
210 215 220 
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Leu Ser Arg Asn Ala Leu Arg Ser Val Lys Ala Asn Val Phe Val His 
225 230 235 240 



Val Ala Pro Gly Ala Phe Leu Gly Met Lys Ala Leu Arg Trp Leu Asp 
260 265 270 
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Leu Leu Gly Leu His Val Leu Arg Leu Ala His Asn Ala lie Ala Ser 
290 295 300 

Leu Arg Pro Arg Thr Phe Lys Asp Leu His Phe Leu Glu Glu Leu Gin 
5 305 310 315 320 

Leu Gly His Asn Arg He Arg Gin Leu Gly Glu Arg Thr Phe Glu Gly 
325 330 335 

10 Leu Gly Gin Leu Glu Val Leu Thr Leu Asn Asp Asn Gin He Thr Glu 

340 345 350 

Val Arg Val Gly Ala Phe Ser Gly Leu Phe Asn Val Ala Val Met Asn 
355 360 365 

15 

Leu Ser Gly Asn Cys Leu Arg Ser Leu Pro Glu Arg Val Phe Gin Gly 

370 375 380 

Leu Asp Lys Leu His Ser Leu His Leu Glu His Ser Cys Leu Gly His 
20 385 390 395 400 

Val Arg Leu His Thr Phe Ala Gly Leu Ser Gly Leu Arg Arg Leu Phe 
405 410 415 

25 Leu Arg Asp Asn Ser He Ser Ser He Glu Glu Gin Ser Leu Ala Gly 

420 425 430 

Leu Ser Glu Leu Leu Glu Leu Asp Leu Thr Thr Asn Arg Leu Thr His 

435 440 445 

30 

Leu Pro Arg Gin Leu Phe Gin Gly Leu Gly His Leu Glu Tyr Leu Leu 
450 455 460 

Leu Ser Tyr Asn Gin Leu Thr Thr Leu Ser Ala Glu Val Leu Gly Pro 
35 465 470 475 480 

Leu Gin Arg Ala Phe Trp Leu Asp He Ser His Asn His Leu Glu Thr 

485 490 495 



Leu Arg Asn Asn Ser Leu Gin Thr Phe Ser Pro Gin Pro Gly Leu Glu 

515 520 525 
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530 535 540 

Ala Leu Arg Asp Phe Ala Leu Gin Asn Pro Gly Val Val Pro Arg Phe 
545 550 555 560 

5 

Val Gin Thr Val Cys Glu Gly Asp Asp Cys Gin Pro Val Tyr Thr Tyr 
565 570 575 

Asn Asn lie Thr Cys Ala Gly Pro Ala Asn Val Ser Gly Leu Asp Leu 
10 580 585 590 

Arg Asp Val Ser Glu Thr His Phe Val His Cys 
595 600 

15 (2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

25 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LIS1 (human), Fig. 34 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Val Leu Ser Gin Arg Gin Arg Asp Glu Leu Asn Arg Ala lie Ala 
35 1 5 10 15 

Asp Tyr Leu Arg Ser Asn Gly Tyr Glu Glu Ala Tyr Ser Val Phe Lys 



Gly Leu Leu Glu Lys Lys Trp Thr Ser Val He Arg Leu Gin Lys Lys 
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65 70 75 80 

Ser Gly Gly Pro Leu Gly Gin Lys Arg Asp Pro Lys Glu Trp lie Pro 
85 90 95 

5 

Arg Pro Pro Glu Lys Tyr Ala Leu Ser Gly His Arg Ser Pro Val Thr 
100 105 110 

Arg Val He Phe His Pro Val Phe Ser Val Met Val Ser Ala Ser Glu 

10 115 120 125 

Asp Ala Thr He Lys Val Trp Asp Tyr Glu Thr Gly Asp Phe Glu Arg 
130 135 140 

15 Thr Leu Lys Gly His Thr Asp Ser Val Gin Asp He Ser Phe Asp His 

145 150 155 160 

Ser Gly Lys Leu Leu Ala Ser Cys Ser Ala Asp Met Thr He Lys Leu 
165 170 175 

20 

Trp Asp Phe Gin Gly Phe Glu Cys He Arg Thr Met His Gly His Asp 
180 185 190 

His Asn Val Ser Ser Val Ala He Met Pro Asn Gly Asp His He Val 
25 195 200 205 

Ser Ala Ser Arg Asp Lys Thr He Lys Met Trp Glu Val Gin Thr Gly 
210 215 220 

3 0 Tyr Cys Val Lys Thr Phe Thr Gly His Arg Glu Trp Val Arg Met Val 

225 230 235 240 

Arg Pro Asn Gin Asp Gly Thr Leu He Ala Ser Cys Ser Asn Asp Gin 
245 250 255 
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Thr Val Arg Val Trp Val Val Ala Thr Lys Glu Cys Lys Ala Glu Leu 
260 265 270 



Ser : y i Ser Ser lie Sei Glu Ala Thr Gly Ser Giu Thr I.ys Lys Ser 
290 295 300 
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Met Trp Asp Val Ser Thr Gly Met Cys Leu Met Thr Leu Val Gly His 
325 330 335 

Asp Asn Trp Val Arg Gly Val Leu Phe His Ser Gly Gly Lys Phe He 

5 340 345 350 

Leu Ser Cys Ala Asp Asp Lys Thr Leu Arg Val Trp Asp Tyr Lys Asn 
355 360 365 

10 Lys Arg Cys Met Lys Thr Leu Asn Ala His Glu His Phe Val Thr Ser 

370 375 380 

Leu Asp Phe His Lys Thr Ala Pro Tyr Val Val Thr Gly Ser Val Asp 
385 390 395 400 



15 



20 



Gin Thr Val Lys Val Trp Glu Cys Arg 
405 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 , Fig. 3 5 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Glu Arg Lys Asp Phe Glu Thr Trp Leu Astd Asn Tl^ S^r Val Thr 



20 25 30 

Tie Ser Leu Ser Glv Ala Val Gin I.ou Ara H: n T.pv. So r Asr A^d t ^ 
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Glu Thr Leu Leu Lys Arg Asp Phe Leu Lys Leu Leu Pro Leu Glu Leu 
50 55 60 

Ser Phe Tyr Leu Leu Lys Trp Leu Asp Pro Gin Thr Leu Leu Thr Cys 
5 65 70 75 80 

Cys Leu Val Ser Lys Gin Arg Asn Lys Val He Ser Ala Cys Thr Glu 
85 90 95 

10 Val Trp Gin Thr Ala Cys Lys Asn Leu Gly Trp Gin He Asp Asp Ser 

100 105 HO 

Val Gin Asp Ser Leu His Trp Lys Lys Val Tyr Leu Lys Ala He Leu 
115 120 125 

15 

Arg Met Lys Gin Leu Glu Asp His Glu Ala Phe Glu Thr Ser Ser Leu 
130 135 140 

He Gly His Ser Ala Arg Val Tyr Ala Leu Tyr Tyr Lys Asp Gly Leu 
20 145 150 155 160 

Leu Cys Thr Gly Ser Asp Asp Leu Ser Ala Lys Leu Trp Asp Val Ser 
165 170 175 

25 Thr Gly Gin Cys Val Tyr Gly He Gin Thr His Thr Cys Ala Ala Val 

180 185 190 

Lys Phe Asp Glu Gin Lys Leu Val Thr Gly Ser Phe Asp Asn Thr Val 
195 200 205 

30 

Ala Cys Trp Glu Trp Ser Ser Gly Ala Arg Thr Gin His Phe Arg Gly 
210 215 220 

His Thr Gly Ala Val Phe Ser Val Asp Tyr Ser Asp Glu Leu Asp He 
35 225 230 235 240 

Leu Val Ser Gly Ser Ala Asp Phe Ala Val Lys Val Trp Ala Leu Ser 

245 250 7^ 



Lys Val Val Leu Gin Lys Cys Lys Val Lys Ser Leu Leu His Ser Pro 
275 280 
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290 295 300 

Pro lie Gly Arg Glu lie Asn Cys Lys Cys Leu Lys Thr Leu Ser Val 
305 310 315 320 

Ser Glu Asp Arg Ser lie Cys Leu Gin Pro Arg Leu His Phe Asp Gly 
325 330 335 

Lys Tyr lie Val Cys Ser Ser Ala Leu Gly Leu Tyr Gin Trp Asp Phe 
10 340 345 350 

Ala Ser Tyr Asp lie Leu Arg Val lie Lys Thr Pro Glu Val Ala Asn 
355 360 365 

15 Leu Ala Leu Leu Gly Phe Gly Asp Val Phe Ala Leu Leu Phe Asp Asn 

370 375 380 

His Tyr Leu Tyr lie Met Asp Leu Arg Thr Glu Ser Leu lie Ser Arg 
385 390 395 400 

20 

Trp Pro Leu Pro Glu Tyr Arg Lys Ser Lys Arg Gly Thr Ser Phe Leu 
405 410 415 

Ala Gly Glu Arg Pro Gly 
25 420 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

35 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 3 
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10 



15 



He Asp Leu Gin Glu Arg Tyr Ser His Trp Lys Lys Asn Thr Lys Leu 
20 25 30 



Leu Tyr Asp Tyr Leu Asn Thr Asn Ser Thr Lys Trp Pro Ser Leu Thr 
35 40 45 



Cys Gin Phe Phe 
10 50 

Leu Leu Ser Ser 
65 

15 Tyr He Ser Lys 



Asn Asn Phe Asp 
100 

20 

Arg Phe Pro Ser 
115 

Asn Gly Glu Cys 

25 130 

He Ala Gly Ala 
145 

3 0 Lys His Gly Ser 



Glu Thr Lys Leu 
180 

35 

Met Asp Thr Ser 
195 



Pro Asp Leu Asp Thr Thr 
55 

Phe Thr Ser Ser Gin Lys 
70 

He Ser Thr Leu Gly His 
85 90 

Met Asp Glu Met Glu Phe 
105 

Lys His Leu Val Asn Asp 
120 

Asn Arg ,Ala Arg Tyr Leu 
135 

Ser Ser Asp Gly Ala He 
150 

Thr Arg lie Arg Gin Ser 
165 170 

Phe Gly Ser His Gly Val 
185 

Ser Ala Asp lie Asn Glu 

200 



Ser Asp Glu His Arg He 
60 

Pro Glu Asp Glu Thr lie 
75 80 

He Lys Trp Ser Ser Leu 
95 

Lys Pro Glu Asn Ser Thr 
110 

lie Ser He Phe Phe Pro 
125 

Pro Gin Asn Pro Asp He 
140 

Tyr lie Phe Asp Arg Thr 
155 160 

Lys lie Ser His Pro Phe 
175 

lie Gin Asp Val Glu Ala 
190 

Ala Thr Ser Leu Ala Trp 

205 



Val Gin Val Trp Asp He Lys Gin Tyr Ser His Glu Asn Pro He He 
225 230 235 240 
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Val Thr Trp Met Pro Thr His Asp Ser Leu Phe Ala Ala Cys Thr Glu 
260 265 270 

Gly Asn Ala Val Ser Leu Leu Asp Leu Arg Thr Lys Lys Glu Lys Leu 

5 275 280 285 

Gin Ser Asn Arg Glu Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe 
290 295 300 

10 Asn Tyr Lys Asn Ser Leu lie Leu Ala Ser Ala Asp Ser Asn Gly Arg 

305 "Un 315 320 

Leu Asn Leu Trp Asp lie Arg Asn Met Asn Lys Ser Pro lie Ala Thr 
325 330 335 

15 

Met Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe 
340 345 350 

Asp Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu 
20 355 360 365 

Trp Asp Thr Ser Cys Glu Glu Thr lie Phe Thr His Gly Gly His Met 
370 375 380 

25 Leu Gly Val Asn Asp lie Ser Trp Asp Ala His Asp Pro Trp Leu Met 

385 390 395 400 

• Cys Ser Val Ala Asn Asp Asn Ser Val His lie Trp Lys Pro Ala Gly 
405 410 415 

30 

Asn Leu Val Gly His Ser 
420 

(2) INFORMATION FOR SEQ ID NO: 54: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 amino acids 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MUS MUSCULUS PROTEIN, Fig. 37 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Phe Arg Met Asp Asn Ala Ser Thr Arg lie Asp Glu Arg Phe Arg lie 
15 10 15 

10 Asp Ala Tyr Ala Asn Ala Arg Tyr Pro Met Pro Arg Thr Glu lie Asn 

20 7* 30 

Ser Glu Gin Glu Asn Cys Glu Asn Thr lie Thr Leu Glu Asp Ser Glu 
35 40 45 

15 

Gin Glu Asn Cys Glu Ala Ala Cys Met Pro Leu Glu Thr Glu Ser Glu 

50 55 60 

Gin Glu Asn Cys Glu Met Ser Ser His Glu Ser Tyr Thr Asn Ala Ala 
20 65 70 75 80 

Glu Thr Pro Glu Asn lie Ser lie Leu Ser Cys Leu Gly Glu Thr Ser 
85 90 95 

2 5 Gly Ala Leu Val Asp Thr Lys Thr lie Ser Asp lie Lys Thr Met Asp 

100 105 110 

Pro Arg Val Ser Leu Thr Pro Ser Ser Asp Val Thr Gly Thr Glu Asp 
115 120 125 

30 

Ser Ser Val Leu Thr Pro Gin Ser Thr Asp Val Asn Ser Val Asp Ser 
130 135 140 

Tyr Gin Gly Tyr Glu Gly Asp Asp Asp Asp Glu Glu Asp Asp Glu Asp 
35 145 150 155 160 

Asp Lys Asp Gly Asp Ser Asn Leu Pro Ser Leu Glu Asp Ser Asp Asn 

165 170 



Gly Glu Val Val Glu Glu Gin Ser Leu Gly Arg Arg Phe His Pro Tyr 

195 200 ?0^ 
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210 



136 
215 



220 



Phe Tyr Pro Tyr Glu Leu Glu Ala Gly Glu Val Val Glu Ala Gin Asn 
225 230 235 240 

Val Gin Asn Leu Phe His Arg Tyr Glu Leu Glu Glu Gly Glu Val Val 
245 250 255 



10 



Glu Ala Gin Val Val Gin Ser Met Phe Pro Tyr Tyr Glu Leu Glu 'Ala 
260 265 270 



Gly Glu Val Val Glu Ala Glu Glu Val Gin Gly Phe Phe Gin Arg Tyr 
275 280 285 



15 



Glu Leu Glu Ala Arg Glu Val He Gly Ala Gin Gly Gly Gin Gly Leu 
290 295 300 



20 



Ser Arg His Tyr Gly Leu Glu Gly Gly Glu Val Val Glu Ala Thr Ala 
305 310 315 320 

Val Arg Arg Leu He Gin His His Glu Leu Glu Glu Gly Glu Asp Val 
325 330 335 



25 



Asp Asp Gin Glu Glu Ser Ser Glu Met His Glu Glu Thr Ser Glu Asp 
340 345 350 



Ser Ser Glu Gin Tyr Asp He Glu Asp Asp Ser Leu He Asp Glu Trp 

355 360 365 



30 



lie Ala Leu Glu Thr Ser Pro Leu Pro Arg Pro Arg Trp Asn Val Leu 
370 375 380 



35 



Ser Ala Leu Arg Asp Arg Gin Leu Gly Ser Ser Gly Arg Phe Val Tyr 
385 390 395 400 

Glu Ala Cys Gly Ala Arg Leu Phe Val Gin Arg Phe Ser Leu Glu His 
405 410 415 



Hxs Gly Thr Leu Leu Ala Ser Gly Ser Asp Asp Leu Lys Val lie Val 
435 440 445 
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Lys Asn Asn lie Leu Gin Ala Lys Phe Leu Pro Asn Cys Asn Asp Ala 
465 470 475 480 

lie Leu Ala Met Cys Gly Arg Asp Gly Gin Val Arg Val Ala Gin Leu 
5 485 490 495 

Ser Ala Val Ala Gly Thr His Met Thr Lys Arg Leu Val Lys His Gly 
500 505 510 

10 Gly Ala Ser His Arg Leu Gly Leu Glu Pro Asp Ser Pro Phe Arg Phe 

515 520 525 

Leu Thr Ser Gly Glu Asp Ala Val Val Phe Asn lie Asp Leu Arg Gin 
530 535 540 

15 

Ala His Pro Ala Ser Lys Leu Leu Val lie Lys Asp Gly Asp Lys Lys 
545 550 555 560 

Val Gly Leu Tyr Thr Val Phe Val Asn Pro Ala Asn Val Tyr Gin Phe 
20 565 570 575 

Ala Val Gly Gly Gin Asp Gin Phe Met Arg lie Tyr Asp Gin Arg Lys 
580 585 590 

| 

25 lie Asp Glu Asn Val Asn Asn Gly Val Leu Lys Lys Phe Cys Pro His 

595 600 605 

His Leu Leu Ser Ser Asp Tyr Pro Ala His lie Thr Ser Leu Met Tyr 

610 615 620 

30 

Ser Tyr Asp Gly Thr Glu lie Leu Ala Ser Tyr Asn Asp Glu Asp lie 
625 630 635 640 

Tyr lie Phe Asn Ser Ser Asp Ser Asp Gly Ala Gin Tyr Ala Lys Arg 
35 645 650 655 

Tyr Lys Gly His Arg Asn Asn Ser Thr Val Lys Gly Val Tyr Phe Tyr 

660 66*5 670 



Phe lie Trp Glu Lys Ser Ser Cys Gin lie Val Gin Phe Leu Glu Ala 

£QO £qt; 7H A 
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705 710 715 720 

Val Leu Ala Ser Ser Gly Leu Asp His Glu Val Lys lie Trp Ser Pro 
725 730 735 

5 

lie Ala Glu Pro Ser Lys Lys Leu Ala Gly Leu Lys Asn Val lie Lys 
740 745 750 

He Asn Lys Leu Lys Arg Asp Asn Phe Thr Leu Arg His Thr Ser Leu 
10 755 760 765 

Phe Asn Asn Ser Met Leu Cys Phe Leu Met Ser His Val Thr Gin Ser 
770 775 780 

15 Asn Tyr Gly Arg Ser Trp Arg Gly He Arg He Asn Ala Gly Gly Gly 

785 790 795 800 

Asp Phe Ser Asp Ser Ser Ser Ser Ser Glu Glu Thr Asn Gin Glu Ser 
805 810 815 

20 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 422 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

30 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

3 5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: ORF RBI , Fig. 3 8 
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Leu Tyr Asp Tyr Leu Asn Thr Asn Ser Thr Lys Trp Pro Ser Leu Thr 
35 40 45 

Cys Gin Phe Phe Pro Asp Leu Asp Thr Thr Ser Asp Glu His Arg lie 
5 50 55 60 

Leu Leu Ser Ser Phe Thr Ser Ser Gin Lys Pro Glu Asp Glu Thr lie 
65 70 75 80 

10 Tyr lie Ser Lys lie Ser Thr Leu Gly His lie Lys Trp Ser Ser Leu 

85 90 95 

Asn Asn Phe Asp Met Asp Glu Met Glu Phe Lys Pro Glu Asn Ser Thr 
100 105 110 

15 

Arg Phe Pro Ser Lys His Leu Val Asn Asp lie Ser lie Phe Phe Pro 
115 120 125 

Asn Gly Glu Cys Asn Arg Ala Arg Tyr Leu Pro Gin Asn Pro Asp lie 
20 130 135 140 

lie Ala Gly Ala Ser Ser Asp Gly Ala lie Tyr lie Phe Asp Arg Thr 
145 150 155 160 

25 Lys His Gly Ser Thr Arg lie Arg Gin Ser Lys lie Ser His Pro Phe 

165 170 175 

Glu Thr Lys Leu Phe Gly Ser His Gly Val lie Gin Asp Val Glu Ala 
180 185 190 

30 

Met Asp Thr Ser Ser Ala Asp lie Asn Glu Ala Thr Ser Leu Ala Trp 
195 200 205 

Asn Leu Gin Gin Glu Ala Leu Leu Leu Ser Ser His Ser Asn Gly Gin 
35 210 215 220 

Val Gin Val Trp Asp lie Lys Gin Tyr Ser His Glu Asn Pro lie lie 
225 230 235 240 



Val Thr Trp Met Pro Thr His Asp Ser Leu Phe Ala Ala Cys Thr Glu 
260 265 710 
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275 280 285 

Gin Ser Asn Arg Glu Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe 
290 295 300 

5 

Asn Tyr Lys Asn Ser Leu lie Leu Ala Ser Ala Asp Ser Asn Gly Arg 
305 310 315 320 

Leu Asn Leu Trp Asp lie Arg Asn Met Asn Lys Ser Pro lie Ala Thr 
10 325 330 335 

Met Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe 
340 345 350 

15 Asp Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu 

355 360 365 

Trp Asp Thr Ser Cys Glu Glu Thr lie Phe Thr His Gly Gly His Met 
370 375 380 

20 

Leu Gly Val Asn Asp lie Ser Trp Asp Ala His Asp Pro Trp Leu Met 
385 390 395 400 

Cys Ser Val Ala Asn ; Asp Asn Ser Val His lie Trp Lys Pro Ala Gly 
25 405 410 415 

Asn Leu Val Gly His Ser 
420 

30 (2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 amino acids 

(B) TYPE: amino acid 
35 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL TSOLATF- ^ - 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met He Ser Ala Thr Asn Trp Val Pro Arg Gly Phe Ser Ser Glu Phe 

15 10 15 

5 

Pro Glu Lys Tyr Val Leu Asp Asp Glu Glu Val Glu Arg He Asn Gin 
20 25 30 

Leu Ala Gin Leu Asn Leu Asp Asp Ala Lys Ala Thr Leu Glu Glu Ala 
10 35 40 45 

Glu Gly Glu Ser Gly Val Glu Asp Asp Ala Ala Thr Gly Ser Ser Asn 
50 55 60 

15 Lys Leu Lys Asp Gin Leu Asp He Asp Asp Asp Leu Lys Glu Tyr Asn 

65 70 75 80 

Leu Glu Glu Tyr Asp Asp Glu Glu He Ala Asp Asn Glu Gly Gly Lys 
85 90 95 

20 

Asp Val Ser Met Phe Pro Gly Leu Ser Asn Asp Ser Asp Val Lys Phe 
100 105 110 

His Glu Gly Glu Lys Gly Glu Asp Pro Tyr He Ser Leu Pro Asn Gin 
25 115 120 125 

Glu Asp Ser Gin Glu Glu Lys Gin Glu Leu Gin Val Tyr Pro Ser Asp 
130 135 140 

3 0 Asn Leu Val Leu Ala Ala Arg Thr Glu Asp Asp Val Ser Tyr Leu Asp 

145 150 155 160 

He Tyr Val Tyr Asp Asp Gly Ala Gly Phe His Ser Ser Asp He Pro 
165 170 175 



35 



Val Glu Glu Gly Asp Glu Ala Asp Pro Asp Val Ala Arg Gly Leu Val 
180 185 190 



Pro Leu Cys Vai G^u Trp Leu Asp Tyr Lys Val Gly Ser Asn Ser Glu 
210 215 220 



WO 95/21252 



PCT/US95/01210 



- 142 - 

He Trp Asn Leu Asp Cys Val Asp Lys Ala Phe Pro Asp Met He Leu 
245 250 255 

Gly Glu Pro Leu Asp Asn Ser Met Val Ser Leu Lys Ser Lys Lys Lys 
260 265 270 

Lys Lys Lys Ser Lys Thr Gly His He Thr Thr His His Thr Asp Ala 
275 280 285 



10 



Val Leu Ser Met Ala His Asn Lys Tyr Phe Arg Ser Val Leu Ala Ser 
290 295 300 



15 



Thr Ser Ala Asp His Thr Val Lys Leu Trp Asp Leu Asn Ser Gly Asn 

305 310 315 320 

Ala Ala Arg Ser Leu Ala Ser He His Ser Asn Lys Asn Val Ser Ser 

325 330 335 



20 



Ser Glu Trp His Met Leu Asn Gly Ser He Leu Leu Thr Gly Gly Tyr 
340 345 350 



Asp Ser Arg Val Ala Leu Thr Asp Val Arg He Ser Asp Glu Ser Gin 
355 360 365 



25 



Met Ser Lys Tyr Trp Ser Ala Met Ala Gly Glu Glu He Glu Thr Val 

370 375 380 



30 



Thr Phe Ala Ser Glu Asn He He Leu Cys Gly Thr Asp Ser Gly Asn 

385 390 395 400 

Val Tyr Ser Phe Asp He Arg Asn Asn Glu Asn Arg Lys Pro Val Trp 
405 410 415 



35 



Thr Leu Lys Ala His Asp Ala Gly He Ser Thr Leu Cys Ser Asn Lys 
420 425 430 



Phe He Pro Gly Met Met Ser Thr Gly Ala Met Gly Glu Lys Thr Val 
435 440 44^ 



Ser Met Val Leu Ser Arg Asp Phe Asp Val Gly Asn Val Leu Thr Ser 
46^ 4"?0 4 ^ ^ 4RO 
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485 



490 



495 



Val Asn Lys Val Leu Lys Leu Trp Asp Val Phe Thr Asn Arg Ser Val 

500 505 510 



Arg Lys Ser Phe Lys Ser Glu Leu Glu Asn Val Gin Ala Arg Ala Lys 
515 520 525 



Glu Glu Ala Gin Lys He Gly Lys Ser Ser Arg He Ala Arg Lys Tyr 
10 530 535 540 

Thr Ser Asn Asp Asn Pro Asp Thr Val He Thr He Asp Asp Gin Gly 
545 550 555 560 

15 Glu Asp Glu Glu Glu Arg Glu Gly Gly Asp Glu His Asp Asp Met Ala 

565 570 575 



(2) INFORMATION FOR SEQ ID NO: 57: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

25 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

3 0 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PLAP, Fig. 4 0 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met His Tyr Met Ser Gly His Ser Asn Ph^ Val P^r Tvr Ya'! T ^ 



20 25 30 

Asn Asp His Asn Hp Cvs Tie Phe Ser T.pt: Anr S^>- Pr^ *** ° *~ T o> 
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Tyr lie Leu Lys Gly His Lys Asp Thr Val Cys Ser Leu Ser Ser Gly 
50 55 60 

Lys Phe Gly Thr Leu Leu Ser Gly Ser Trp Asp Thr Thr Ala Lys Val 
5 65 70 75 80 

Trp Leu Asn Asp Lys Cys Met Met Thr Leu Gin Gly His Thr Ala Ala 
85 90 95 

10 Val Trp Ala Val Lys lie Leu Pro Glu Gin Gly Leu Met Leu Thr Gly 

100 105 110 

Ser Ala Asp Lys Thr lie Lys Leu Trp Lys Ala Gly Arg Cys Glu Arg 
115 120 125 

15 

Thr Phe Leu Gly His Glu Asp Cys Val Arg Gly Leu Ala lie Leu Ser 
130 135 140 

Glu Thr Glu Phe Leu Ser Cys Ala Asn Asp Ala Ser lie Arg Arg Trp 
20 145 150 155 160 

Gin lie Thr Gly Glu Cys Leu Glu Val Tyr Phe Gly His Thr Asn Tyr 
165 170 175 

2 5 lie Tyr Ser lie Ser Val Phe Pro Asn Ser Lys Asp Phe Val Thr Thr 

180 185 190 

Ala Glu Asp Arg Ser Leu Arg lie Trp Lys His Gly Glu Cys Ala Gin 
195 200 205 

30 

Thr lie Arg Leu Pro Ala Gin Ser lie Trp Cys Cys Cys Val Leu Glu 
210 215 220 

Asn Gly Asp lie Val Val Gly Ala Ser Asp Gly lie lie Arg Val Phe 
35 225 230 235 240 

Thr Glu Ser Glu Glu Arg Thr Ala Ser Ala Glu Glu lie Lys Ala Ser 

245 250 255 



Pro lie lie Thr Pro Val Arg Arg Thr Leu Pro Cys Arg Val Thr Arg 
275 2R0 2R5 
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290 295 300 

Thr Ser Asp Ser His Leu Thr lie Thr Ala Leu His Leu Phe Leu Thr 

305 310 315 320 

5 

Thr Thr Thr Thr Glu 

325 

(2) INFORMATION FOR SEQ ID NO: 58: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 425 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

15 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
20 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN - 

HUMAN, Fig. 41 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Met Ala Asp Lys Glu Ala Ala Phe Asp Asp Ala Val Glu Glu Arg Val 
30 1 5 10 15 

lie Asn Glu Glu Tyr Lys lie Trp Lys Lys Asn Thr Pro Phe Leu Tyr 
20 25 30 

35 Asp Leu Val Met Thr His Ala Leu Glu Trp Pro Ser Leu Thr Ala Gin 

35 40 45 

Trp Leu Pro Asp Val Thr Arg Pro Glu Gly Lys Asp Phe Ser lie His 



6 b 7 0 ' i: o •• • 

He Ala Ser Val Gin Leu Pro Asn Asp Asp Ala Gin Phe Asp Ala Ser 
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His Tyr Asp Ser Glu Lys Gly Glu Phe Qly Qly phe ^ ^ 

100 



10 



20 



25 



35 



105 



110 



Gly Lys II. Glu He Glu lle L ys Ile Agn His Qlu Qly 

±ZU 125 
Arg Ala Arg Tyr Met Pro Gin Asn Pro c ys n e Iie Ala Thr Lys Thr 

130 1 "3 C 

XJb 140 
Pro Ser Ser Asp Val Leu Val Phe Asp Tyr Thr Lys His Pro Ser Lys 

Jo 

Pro Asp Pro Ser Gly Glu Cys Asn Pro Asp Leu Arg Leu Arg Gly His 

Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser Gly His 
180 185 190 

Leu Leu ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp lie Ser 
95 200 205 

Ala Val Pro Lys Glu Gly Lys Val Val Asp Ala Lys Thr Xle Phe Thr 
210 215 220 

Oly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 

30 250 255 



Thr Arg Ser Asn Asn Thr Ser Lys Pro Ser Hls Ser V£jl ^ ^ ^ 
260 265 270 

Thr Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu Phe lie 
275 280 285 

Leu Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Trp Asp Leu Arg 

295 300 



_ Jer 

325 370 
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340 345 350 

Glu Gin Ser Pro Glu Asp Ala Glu Asp Gly Pro Pro Glu Leu Leu Phe 
355 360 365 

5 

lie His Gly Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro 

370 375 380 

Asn Glu Pro Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin 
10 385 390 395 400 

Val Trp Gin Met Ala Glu Asn He Tyr Asn Asp Glu Asp Pro Glu Gly 
405 410 415 

15 Ser Val Asp Pro Glu Gly Gin Gly Ser 

420 425 

(2) INFORMATION FOR SEQ ID NO: 59: 

2 0 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 852 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

2 5 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: S253 PROTEIN, Fig. 42 



3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Phe Lys Ser Lys Thr Ser Thr Leu Ser Tyr Asp Glu Thr Pro Asn 

15 10 1^ 



Ser Gin Thr Lys His Leu Asn He Pro Gly Asp Arg Ser Arg His Ser 
35 40 4^ 
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50 55 60 

Ser Ala Asp He He Pro Ala Gin Leu Arg Phe He Asp Asn He Asp 
65 70 75 80 

5 

Tyr Gly Thr Arg Leu Arg Lys Thr Leu His Arg Asn Ser Val Val Ser 
85 90 95 

Asn Gly Tyr Asn Lys Leu Ser Glu Asn Asp Arg Trp Tyr Phe Asp Leu 
10 100 105 110 

Phe Asp Arg Lys Tyr Phe Glu Asn Tyr Leu Glu Glu Pro Thr Tyr He 
115 120 125 

15 Lys He Phe Lys Lys Lys Glu Gly Leu Glu Gin Phe Asp Arg Met Phe 

130 135 140 

Leu Ala Gin Glu Leu Lys lie Pro Asp Val Tyr Lys Ser Thr Thr Tyr 
145 150 155 160 

20 

Gin Gly Glu Pro Ala Val Ala Asn Ser Glu Leu Phe Lys Asn Ser He 
165 170 175 

Cys Cys Cys Thr Phe Ser His Asp Gly Lys Tyr Met Val He Gly Cys 
25 180 185 190 

Lys Asp Gly Ser Leu His Leu Trp Lys Val He Asn Ser Pro Val Lys 
195 200 205 

30 Arg Ser Glu Met Gly Arg Ser Glu Lys Ser Val Ser Ala Ser Arg Ala 

210 215 220 

Asn Ser Leu Lys He Gin Arg His Leu Ala Ser He Ser Ser His Asn 
225 230 235 240 



35 



Gly Ser He Ser Ser Asn Asp Leu Lys Pro Ser Asp Gin Phe Glu Gly 

245 250 255 



.k: /Ai'j la^ £ j ne Met u..u His A±a Leu Asp lie Leu Asp Ala Asn Trp 
275 280 285 
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Leu Trp His Pro Glu Arg Lys Tyr Ser Leu Lys Thr Phe Val His Pro 
305 310 315 320 

Asp Phe Val Thr Ser Ala lie Phe Phe Pro Asn Asp Asp Arg Phe lie 

5 325 330 335 

lie Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser He Leu Asp 
340 345 350 

10 Asn Glu Val Ser Tyr Ala Phe Asp Cys Lys Asp Leu He Thr Ser Leu 

355 360 365 

Thr Leu Ser Pro Pro Gly Gly Glu Tyr Thr He He Gly Thr Phe Asn 
370 375 380 

15 

Gly Tyr He Tyr Val Leu Leu Thr His Gly Leu Lys Phe Val Ser Ser 
385 390 395 400 

Phe His Val Ser Asp Lys Ser Thr Gin Gly Thr Thr Lys Asn Ser Phe 
20 405 410 415 

His Pro Ser Ser Glu Tyr Gly Lys Val Gin His Gly Pro Arg He Thr 
420 425 430 

25 Gly Leu Gin Cys Phe Phe Ser Lys Val Asp Lys Asn Leu Arg Leu He 

435 440 445 

Val Thr Thr Asn Asp Ser Lys He Gin He Phe Asp Leu Asn Glu Lys 
450 455 460 

30 

Lys Pro Leu Glu Leu Phe Lys Gly Phe Gin Ser Gly Ser Ser Arg His 
465 470 475 480 

Arg Gly Gin Phe Leu Met Met Lys Asn Glu Pro Val Val Phe Thr Gly 
35 485 490 495 

Ser Asp Asp His Trp Phe Tyr Thr Trp Lys Met Gin Ser Phe Asn Leu 
500 505 S10 



Gly Ser Met Ser Leu Lys Gly Leu Leu Arg He Val Ser Asn Lys Ser 
530 535 S40 
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545 550 555 560 

Thr Phe Thr Asn Ser Ser Lys Asn Val Leu Gin Thr Gin Thr Val Gly 
565 570 575 

5 

Ser Gin Ala He Lys Asn Asn His Tyr He Ser Phe His Ala His Asn 
580 585 590 

Ser Pro Val Thr Cys Ala Ser He Ala Pro Asp Val Ala He Lys Asn 
10 595 600 605 

Leu Ser Leu Ser Asn Asp Leu lie Phe Glu Leu Thr Ser Gin Tyr Phe 
610 615 620 

15 Lys Glu Met Gly Gin Asn Tyr Ser Glu Ser Lys Glu Thr Cys Asp Asn 

625 630 635 640 

Lys Pro Asn His Pro Val Thr Glu Thr Gly Gly Phe Ser Ser Asn Leu 
645 650 655 

20 

Ser Asn Val Val Asn Asn Val Gly Thr lie Leu He Thr Thr Asp Ser 
660 665 670 

Gin Gly Leu He Arg Val Phe Arg Thr Asp He Leu Pro Glu He Arg 
25 675 680 685 

Lys Lys He lie Glu Lys Phe His Glu Tyr Asn Leu Phe His Leu Glu 
690 . 695 700 

3 0 Ala Ala Gly Lys lie Asn Asn His Asn Asn Asp Ser lie Leu Glu Asn 

705 710 715 720 

Arg Met Asp Glu Arg Ser Ser Thr Glu Asp Asn Glu Phe Ser Thr Thr 
725 730 735 



35 



Pro Pro Ser Asn Thr His Asn Ser Arg Pro Ser His Asp Phe Cys Glu 
740 745 750 



770 



775 



780 
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Pro His Asp lie Pro Arg Val Ser Thr Thr Tyr Pro Lys Leu Lys Cys 
805 810 815 

Asp Val Cys Asn Gly Ser Asn Phe Glu Cys Ala Ser Lys Asn Pro lie 

820 825 830 

Ala Gly Gly Asp Ser Gly Phe Thr Cys Ala Asp Cys Gly Thr lie Leu 
835 840 845 



10 Asn Asn Phe Arg 

850 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOF1 , Fig. 43 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Lys lie Lys Thr lie Lys Arg Ser Ala Asp Asp Tyr Val Pro Val 
15 10 15 

3 5 Lys Ser Thr Gin Glu Ser Gin Met Pro Arg Asn Leu Asn Pro Glu Leu 

20 25 30 

His Pro Phe Glu Arg Ala Arg Glu Tyr Thr T,vr Ala t,ph air, 



Gly His Arg Asp Gly Val Tyr Ala Hp Ala Lvn Ar^. 
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Asn Lys Leu Ala Thr Gly Ser Ala Asp Gly Val He Lys Tyr Trp Asn 
85 90 95 

Met Ser Thr Arg Glu Glu Phe Val Ser Phe Lys Ala His Tyr Gly Leu 
5 100 105 no 

Val Thr Gly Leu Cys Val Thr Gin Pro Arg Phe His Asp Lys Lys Pro 
115 120 125 



10 Asp Leu Lys Ser Gin Asn Phe Met Leu Ser Cys Ser Asp Asp Lys Thr 

130 135 140 

Val Lys Leu Trp Ser He Asn Val Asp Asp Tyr Ser Asn Lys Asn Ser 
145 150 155 160 

15 

Ser Asp Asn Asp Ser Val Thr Asn Glu Glu Gly Leu He Arg Thr Phe 
165 170 175 

Asp Gly Glu Ser Ala Phe Gin Gly He Asp Ser His Arg Glu Asn Ser 
20 180 185 190 

Thr Phe Ala Thr Gly Gly Ala Lys He His Leu Trp Asp Val Asn Arg 
195 200 205 

25 Leu Lys Pro Val Ser Asp Leu Ser Trp Gly Ala Asp Asn He Thr Ser 

210 215 220 

Leu Lys Phe Asn Gin Asn Glu Thr Asp He Leu Ala Ser Thr Gly Ser 

225 230 235 240 

30 

Asp Asn Ser He Val Leu Tyr Asp Leu Arg Thr Asn Ser Pro Thr Gin 
245 250 255 

Lys He Val Gin Thr Met Arg Thr Asn Ala He Cys Trp Asn Pro Met 
35 260 265 270 

Glu Ala Phe Asn Phe Val Thr Ala Asn Glu Asp His Asn Ala Tyr Tyr 
275 280 



Val Ser Ala Val Met Asp Val Asp Phe Ser Pro Thr Gly Asp Glu lie 
305 310 31 ^ 
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325 330 335 

Gly His Ser Arg Glu He Tyr His Thr Lys Arg Met Gin His Val Phe 
340 345 350 

5 

Val Lys Tyr Ser Met Asp Ser Lys Tyr He He Ser Gly Ser Asp Asp 

355 360 365 

Gly Asn Val Arg Leu Trp Arg Ser Lys Ala Trp Glu Arg Ser Asn Val 
10 370 375 380 

Lys Thr Thr Arg Glu Lys Asn Lys Leu Glu Tyr Asp Glu Lys Leu Lys 

385 390 395 400 

15 Glu Arg Phe Arg His Met Pro Glu He Lys Arg He Ser Arg His Arg 

405 410 415 

His Val Pro Gin Val He Lys Lys Ala Gin Glu He Lys Asn He Glu 
420 425 430 

20 

Leu Ser Ser He Lys Arg Arg Glu Ala Asn Glu Arg Arg Thr Arg Lys 
435 440 445 

Asp Met Pro Tyr He Ser Glu Arg Lys Lys Gin He Val Gly Thr Val 
25 450 455 460 

His Lys Tyr Glu Asp Ser Gly Arg Asp Arg Lys Arg Arg Lys Glu Asp 
465 470 475 480 

3 0 Asp Lys Arg Asp Thr Gin Glu Lys 

485 

(2) INFORMATION FOR SEQ ID NO: 61: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(iv) ANTI -SENSE: NO 
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(C) INDIVIDUAL ISOLATE: STE4 - YEAST, Fig. 44 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

5 

Met Ala Ala His Gin Met Asp Ser lie Thr Tyr Ser Asn Asn Val Thr 
15 10 15 

Gin Gin Tyr He Gin Pro Gin Ser Leu Gin Asp He Ser Ala Val Glu 
10 20 25 30 

Asp uiu He Gin Asn Lys He Glu Ala Ala Arg Gin Glu Ser Lys Gin 
35 40 45 

15 Leu His Ala Gin He Asn Lys Ala Lys His Lys He Gin Asp Ala Ser 

50 55 60 

Leu Phe Gin Met Ala Asn Lys Val Thr Ser Leu Thr Lys Asn Lys He 

65 70 75 80 

20 

Asn Leu Lys Pro Asn He Val Leu Lys Gly His Asn Asn Lys lie Ser 
85 90 95 

Asp Phe Arg Trp Ser Arg Asp Ser Lys Arg He Leu Ser Ala Ser Gin 
25 100 105 HO 

Asp Gly Phe Met Leu He Trp Asp Ser Ala Ser Gly Leu Lys Gin Asn 
115 120 125 



30 



35 



Ala He Pro Leu Asp Ser Gin Trp Val Leu Ser Cys Ala He Ser Pro 
130 135 140 

Ser Ser Thr Leu Val Ala Ser Ala Gly Leu Asn Asn Asn Cys Thr He 

145 150 155 160 

Tyr Arg Val Ser Lys Glu Asn Arg Val Ala Gin Asn Val Ala Ser He 

165 170 175 



Leu Thr A^a Ser Gly Asp Met Thr Cys Ala Leu Trp Asp 
195 200 205 
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10 



15 



20 



25 



30 



35 



155 



Val Leu Ala Leu Ala lie Pro Glu Glu Pro Asn Leu Glu 



225 



230 



235 



Asn Ser Ser 
240 



Asn Thr Phe Ala Ser Cys Gly Ser Asp Gly Tyr Thr Tyr lie Trp Asp 
245 2 50 255 

Ser Arg Ser Pro Ser Ala Val Gin Ser Phe Tyr Val Asn Asp Ser Asp 
260 265 270 

He Asn Ala Leu Arg Phe Phe Lys Asp Gly Met Ser lie Val Ala Gly 
275 280 285 

Ser Asp Asn Gly Ala lie Asn Met Tyr Asp Leu Arg Ser Asp Cys Ser 



290 



295 



300 



He Ala Thr Phe Ser Leu Phe Arg Gly Tyr Glu Glu Arg Thr Pro Thr 

305 310 tic 

315 320 



Ser Pro 
335 



Pro Thr Tyr Met Ala Ala Asn Met Glu Tyr Asn Thr Ala Gin 
325 330 

Gin Thr Leu Lys Ser Thr Ser Ser Ser Tyr Leu Asp Asn Gin Gly Val 
340 345 350 

Val Ser Leu Asp Phe Ser Ala Ser Gly Arg Leu Met Tyr Ser Cys Tyr 

355 3 60 365 

Thr Asp lie Gly Cys Val Val Trp Asp Val Leu Lys Gly Glu lie Val 

370 375 38O 

Gly Lys Leu Glu Gly His Gly Gly Arg Val Thr Gly Val Arg Ser Ser 



385 



390 



395 



400 



Pro Asp Gly Leu Ala Val Cys Thr Gly Ser Trp Asp Ser Thr Met 



405 

He Trp Ser Pro Gly Tyr Gin 
420 



Lys 



410 



415 



li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 704 amino acids 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE: NO 

Cvi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRANSCRIPTION FACTOR TIIF, Fig. 45 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Met Ser Leu Glu Val Ser Asn He Asn Gly Gly Asn Gly Thr Gin Leu 
15 10 15 

15 

Ser His Asp Lys Arg Glu Leu Leu Cys Leu Leu Lys Leu He Lys Lys 
20 25 30 

Tyr Gin Leu Lys Ser Thr Glu Glu Leu Leu Cys Gin Glu Ala Asn Val 
20 35 40 45 

Ser Ser Val Glu Leu Ser Glu He Ser Glu Ser Asp Val Gin Gin Val 
50 55 60 

2 5 Leu Gly Ala Val Leu Gly Ala Gly Asp Ala Asn Arg Glu Arg Lys His 

65 70 75 80 

Val Gin Ser Pro Ala Gin Gly His Lys Gin Ser Ala Val Thr Glu Ala 
85 90 95 

30 

Asn Ala Ala Glu Glu Leu Ala Lys Phe He Asp Asp Asp Ser Phe Asp 
100 105 110 

Ala Gin His Tyr Glu Gin Ala Tyr Lys Glu Leu Arg Thr Phe Val Glu 
35 115 120 125 

Asp Ser Leu Asp He Tyr Lys His Glu Leu Ser Met Val Leu Tyr Pro 
130 135 140 



Lys Ala Lys Glu Phe He Glu Lys Tyr Lys Cys Asp Leu Asp Gly Tyr 

165 1 ™ 
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Tyr lie Glu Gly Leu Phe Asn Leu Leu Leu Leu Ser Lys Pro Glu Glu 
180 165 190 

Leu Leu Glu Asn Asp Leu Val Val Ala Met Glu Gin Asp Lys Phe Val 
5 195 200 205 

lie Arg Met Ser Arg Asp Ser His Ser Leu Phe Lys Arg His lie Gin 
210 215 220 

10 Asp Arg Arg Gin Glu Val Val Ala Asp lie Val Ser Lys Tyr Leu His 

225 230 235 240 

Phe Asp Thr Tyr Glu Gly Met Ala Arg Asn Lys Leu Gin Cys Val Ala 
245 250 255 

15 

Thr Ala Gly Ser His Leu Gly Glu Ala Lys Arg Gin Asp Asn Lys Met 
260 265 270 

Arg Val Tyr Tyr Gly Leu Leu Lys Glu Val Asp Phe Gin Thr Leu Thr 
20 275 280 285 

Thr Pro Ala Pro Ala Pro Glu Glu Glu Asp Asp Asp Pro Asp Ala Pro 

290 295 300 

25 Asp Arg Pro Lys Lys Lys Lys Pro Lys Lys Asp Pro Leu Leu Ser Lys 

305 310 315 320 

Lys Ser Lys Ser Asp Pro Asn Ala Pro Ser lie Asp Arg lie Pro Leu 

325 330 335 

30 

Pro Glu Leu Lys Asp Ser Asp Lys Leu Leu Lys Leu Lys Ala Leu Arg 
340 345 350 

Glu Ala Ser Lys Arg Leu Ala Leu Ser Lys Asp Gin Leu Pro Ser Ala 
35 355 360 365 

Val Phe Tyr Thr Val Leu Asn Ser His Gin Gly Val Thr Cys Ala Glu 
370 375 380 



Val Arg lie Trp Ser Leu Thr Pro Ala Asn Val Arg Thr Leu Lys Asp 
405 410 415 
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420 425 430 

Arg Met Leu Asp Asp Arg Ser Gly Glu Val Thr Arg Ser Leu Met Gly 
435 440 445 

5 

His Thr Gly Pro Val Tyr Arg Cys Ala Phe Ala Pro Glu Met Asn Leu 
450 455 460 

Leu Leu Ser Cys Ser Glu Asp Ser Thr lie Arg Leu Trp Ser Leu Leu 
10 465 470 475 480 

Thr Trp Ser Cys Val Val Thr Tyr Arg Gly His Val Tyr Pro Val Trp 
485 490 495 

15 Asp Val Arg Phe Ala Pro His Gly Tyr Tyr Phe Val Ser Cys Ser Tyr 

500 505 510 

Asp Lys Thr Ala Arg Leu Trp Ala Thr Asp Ser Asn Gin Ala Leu Arg 
515 520 525 

20 

Val Phe Val Gly His Leu Ser Asp Val Asp Cys Val Gin Phe His Pro 
530 535 540 

Asn Ser Asn Tyr Val Ala; Thr Gly Ser Ser Asp Arg Thr Val Arg Leu 
25 545 550 555 560 

Trp Asp Asn Met Thr Gly Gin Ser Val Arg Leu Met Thr Gly His Lys 
565 i 570 575 



30 



35 



Gly Ser Val Ser Ser Leu Ala Phe Ser Ala Cys Gly Arg Tyr Leu Ala 

580 585 590 

Ser Gly Ser Val Asp His Asn lie lie He Trp Asp Leu Ser Asn Gly 
595 600 605 

Ser Leu Val Thr Thr Leu Leu Arg His Thr Ser Thr Val Thr Thr He 
610 615 620 



..a.u., .ni ^eu I'rp Asp Phe His Lys Val Thr Giu Asp Tyr He Ser 
645 650 655 
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Tyr Leu Met Arg Thr Phe Pro Ser Lys Asn Ser Pro Phe Val Ser Leu 
675 680 685 

His Phe Thr Arg Arg Asn Leu Leu Met Cys Val Gly Leu Phe Lys Ser 
690 695 700 



{2) INFORMATION FOR SEQ ID NO: 63: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1, Fig. 46 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

Met Thr Ala Ser Val Ser Asn Thr Gin Asn Lys Leu Asn Glu Leu Leu 
15 10 15 

3 0 Asp Ala He Arg Gin Glu Phe Leu Gin Val Ser Gin Glu Ala Asn Thr 

20 25 30 

Tyr Arg Leu Gin Asn Gin Lys Asp Tyr Asp Phe Lys Met Asn Gin Gin 
35 40 45 



35 



Leu Ala Glu Met Gin Gin He Arg Asn Thr Val Tyr Glu Leu Glu Leu 
50 55 60 



c.-y Asp His Gin He A^a Ser Leu Thr Val 
85 90 95 
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Gin Gin Gin Gin Gin Gin Leu Ala Ala Ala Ser Ala Ser Val Pro Val 
115 120 125 

Ala Gin Gin Pro Pro Ala Thr Thr Ser Ala Thr Ala Thr Pro Ala Ala 
5 130 135 140 

Asn Thr Thr Thr Gly Ser Pro Ser Ala Phe Pro Val Gin Ala Ser Arg 
145 150 155 160 

10 Pro Asn Leu Val Gly Ser Gin Leu Pro Thr Thr Thr Leu Pro Val Val 

165 170 175 

Ser Ser Asn Ala Gin Gin Gin Leu Pro Gin Gin Gin Leu Gin Gin Gin 
180 185 190 

15 

Gin Leu Gin Gin Gin Gin Pro Pro Pro Gin Val Ser Val Ala Pro Leu 
195 200 205 

Ser Asn Thr Ala lie Asn Gly Ser Pro Thr Ser Lys Glu Thr Thr Thr 
20 210 215 220 

Leu Pro Ser Val Lys Ala Pro Glu Ser Thr Leu Lys Glu Thr Glu Pro 
225 230 235 240 

25 Glu Asn Asn Asn Thr Ser Lys lie Asn Asp Thr Gly Ser Ala Thr Thr 

245 250 255 

Ala Thr Thr Thr Thr Ala Thr Glu Thr Glu lie Lys Pro Lys Glu Glu 

260 265 270 

30 

Asp Ala Thr Pro Ala Ser Leu His Gin Asp His Tyr Leu Val Pro Tyr 
275 280 285 

Asn Gin Arg Ala Asn His Ser Lys Pro lie Pro Pro Phe Leu Leu Asp 
35 290 295 300 

Leu Asp Ser Gin Ser Val Pro Asp Ala Leu Lys Lys Gin Thr Asn Asp 
305 310 315 120 



Leu His Lys Ser Leu Asp His Thr Ser Val Val Cys Cys Val Lys Phe 
340 345 l^o 
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355 360 365 

Val Tyr Arg Val Ser Asp Gly Ser Leu Val Ala Arg Leu Ser Asp Asp 
370 375 380 

5 

Ser Ala Ala Asn Asn His Arg Asn Ser lie Thr Glu Asn Asn Thr Thr 
385 390 395 400 

Thr Ser Thr Asp Asn Asn Thr Met Thr Thr Thr Thr Thr Thr Thr lie 
10 405 410 415 

Thr Thr Thr Ala Met Thr Ser Ala Ala Glu Leu Ala Lys Asp Val Glu 
420 425 430 

15 Asn Leu Asn Thr Ser Ser Ser Pro Ser Ser Asp Leu Tyr lie Arg Ser 

435 440 445 

Val Cys Phe Ser Pro Asp Gly Lys Phe Leu Ala Thr Gly Ala Glu Asp 
450 455 460 

20 

Arg Leu lie Arg lie Trp Asp He Glu Asn Arg Lys He Val Met He 
465 470 475 480 

Leu Gin Gly His Glu Gin Asp He Tyr Ser Leu Asp Tyr Phe Pro Ser 
25 485 490 495 

Gly Asp Lys Leu Val Ser Gly Ser Gly Asp Arg Thr Val Arg He Trp 
500 505 510 

30 Asp Leu Arg Thr Gly Gin Cys Ser Leu Thr Leu Ser He Glu Asp Gly 

515 520 525 

Val Thr Thr Val Ala Val Ser Pro Gly Asp Gly Lys Tyr He Ala Ala 
530 535 540 



35 



Gly Ser Leu Asp Arg Ala Val Arg Val Trp Asp Ser Glu Thr Gly Phe 
545 550 555 560 



580 



W;e Tni Arg Asp 
585 



Gin Ser Val Val 
590 
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Asn Asn Lys Ser Asp Ser Lys Thr Pro Asn Ser Gly Thr Cys Glu Val 
610 615 620 

Thr Tyr lie Gly His Lys Asp Phe Val Leu Ser Val Ala Thr Thr Gin 
5 625 630 635 640 

Asn Asp Glu Tyr lie Leu Ser Gly Ser Lys Asp Arg Gly Val Leu Phe 
645 650 655 

10 Trp Asp Lys Lys Ser Gly Asn Pro Leu Leu Met Leu Gin Gly His Arg 

660 665 670 

Asn Ser Val lie Ser Val Ala Val Ala Asn Gly Ser Ser Leu Gly Pro 
675 680 685 

15 

Glu Tyr Asn Val Phe Ala Thr Gly Ser Gly Asp Cys Lys Ala Arg lie 
690 695 700 

Trp Lys Tyr Lys Lys lie Ala Pro Asn 
20 70§ 710 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
2 5 (A) LENGTH: 798 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: protein 

30 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 HOMOLOG , Fig. 47 



Pro Gin Pro Val Lys Asn Gin Arg Thr Asn Asn Ala Ala Gly Ala Asn 
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Ser Gly Gin Gin Pro Gin Gin Gin Ser Gin Gly Gin Ser Gin Gin Gin 
35 40 45 

Gly Arg Ser Asn Gly Pro Phe Ser Ala Ser Asp Leu Asn Arg lie Val 
5 50 55 60 

Leu Glu Tyr Leu Asn Lys Lys Gly Tyr His Arg Thr Glu Ala Met Leu 
65 70 75 80 

10 Arg Ala Glu Ser Gly Arg Thr Leu Thr Pro Gin Asn Lys Gin Ser Pro 

85 90 95 

Ala Asn Thr Lys Thr Gly Lys Phe Pro Glu Gin Ser Ser lie Pro Pro 
100 105 110 

15 

Asn Pro Gly Lys Thr Ala Lys Pro lie Ser Asn Pro Thr Asn Leu Ser 
115 120 125 

Ser Lys Arg Asp Ala Glu Gly Gly He Val Ser Ser Gly Arg Leu Glu 
20 130 135 140 

Gly Leu Asn Ala Pro Glu Asn Tyr He Arg Ala Tyr Ser Met Leu Lys 
145 150 155 160 

2 5 Asn Trp Val Asp Ser Ser Leu Glu He Tyr Lys Pro Glu Leu Ser Tyr 

165 170 175 

He Met Tyr Pro He Phe He Tyr Leu Phe Leu Asn Leu Val Ala Lys 

180 185 190 

30 

Asn Pro Val Tyr Ala Arg Arg Phe Phe Asp Arg Phe Ser Pro Asp Phe 
195 200 205 

Lys Asp Phe His Gly Ser Glu He Asn Arg Leu Phe Ser Val Asn Ser 
35 210 215 220 

He Asp His He Lys Glu Asn Glu Val Ala Ser Ala Phe Gin Ser His 
225 230 235 240 



Phe Leu Asn Glu Asn Glu Ser He Gly Gly Ser Leu He He Ser Val 
260 265 270 
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275 280 285 

Glu Lys Leu Ala Asp Gly lie Lys Val Leu Ser Asp Ser Glu Asn Gly 
290 295 300 

5 

Asn Gly Lys Gin Asn Leu Glu Met Asn Ser Val Pro Val Lys Leu Gly 
305 310 315 320 

Pro Phe Pro Lys Asp Glu Glu Phe Val Lys Glu lie Glu Thr Glu Leu 
10 325 330 335 

Lys lie Lys Asp Asp Gin Glu Lys Gin Leu Asn Gin Gin Thr Ala Gly 
340 345 350 

15 Asp Asn Tyr Ser Gly Ala Asn Asn Arg Thr Leu Leu Gin Glu Tyr Lys 

355 360 365 

Ala Met Asn Asn Glu Lys Phe Lys Asp Asn Thr Gly Asp Asp Asp Lys 
370 375 380 

20 

Asp Lys He Lys Asp Lys He Ala Lys Asp Glu Glu Lys Lys Glu Ser 
385 390 395 400 

Glu Leu Lys Val Asp Gly Glu Lys Lys Asp Ser Asn Leu Ser Ser Pro 
25 405 410 415 

Ala Arg Asp He Leu Pro Leu Pro Pro Lys Thr Ala Leu Asp Leu Lys 
420 425 430 

3 0 Leu Glu He Gin Lys Val Lys Glu Ser Arg Asp Ala He Lys Leu Asp 

435 440 445 

Asn Leu Gin Leu Ala Leu Pro Ser Val Cys Met Tyr Thr Phe Gin Asn 
450 455 460 



35 



Thr Asn Lys Asp Met Ser Cys Leu Asp Phe Ser Asp Asp Cys Arg He 
465 470 475 480 



a;i So: ^eu Asm Asr: Pro Asn lie A^a Leu Asn Asn Asn Asp Lys 
500 505 510 
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Tyr Ser Thr Ser Phe Ser Pro Asp Asn Lys Tyr Leu Leu Ser Gly Ser 
530 535 540 

Glu Asp Lys Thr Val Arg Leu Trp Ser Met Asp Thr His Thr Ala Leu 
545 550 555 560 

Val Ser Tyr Lys Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser 
565 570 575 



10 



Pro Leu Gly His Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg 
580 585 590 



15 



Leu Trp Ser Cys Asp His lie Tyr Pro Leu Arg lie Phe Ala Gly His 
595 600 605 

Leu Asn Asp Val Asp Cys Val Ser Phe His Pro Asn Gly Cys Tyr Val 
610 615 620 



20 



Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp Val Ser Thr 
625 630 635 640 



Gly Asp Ser Val Arg Leu Phe Leu Gly His Thr Ala Pro Val lie Ser 
645 650 655 



25 



lie Ala Val Cys Pro Asp Gly Arg Trp Leu Ser Thr Gly Ser Glu Asp 
660 665 670 



30 



Gly lie lie Asn Val Trp Asp lie Gly Thr Gly Lys Arg Leu Lys Gin 

675 680 685 

Met Arg Gly His Gly Lys Asn Ala lie Tyr Ser Leu Ser Tyr Ser Lys 
690 695 700 



35 



Glu Gly Asn Val Leu lie Ser Gly Gly Ala Asp His Thr Val Arg Val 
705 710 715 720 



Trp Asp Leu Lys Lys Ala Thr Thr Glu Pro Ser Ala Glu Pro Asp Glu 
725 730 7^ 



He Lys Glu Tyr Gly Arg Arg Arg Thr Val He Pro Thr Ser Asp Leu 
755 760 76S 
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770 775 780 

Ser Arg Ser Asn Leu Ala Leu Ala Gly Gly Ala Phe Arg Pro 
785 790 795 

(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 439 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
20 (C) INDIVIDUAL ISOLATE: YCU7 , Fig. 48 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

2 5 Met Val Arg Arg Phe Arg Gly Lys Glu Leu Ala Ala Thr Thr Phe Asn 

15 10 15 

Gly His Arg Asp Tyr Val Met Gly Ala Phe Phe Ser His Asp Gin Glu 

20 25 30 

30 

Lys He Tyr Thr Val Ser Lys Asp Gly Ala Val Phe Val Trp Glu Phe 
35 40 45 

Thr Lys Arg Pro Ser Asp Asp Asp Asp Asn Glu Ser Glu Asp Asp Asp 
35 50 55 60 

Lys Gin Glu Glu Val Asp He Ser Lys Tyr Ser Trp Arg He Thr Lys 
65 70 75 80 



His Pro Ala Thr Arg Leu Leu Ala Val Gly Phe Thr Ser Gly Glu Phe 
100 105 110 
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115 120 125 

Gly Gin Asn Pro Val Asn Thr Val Ser Val Asn Gin Thr Gly Glu Trp 
130 135 140 

5 

Leu Ala Phe Gly Ser Ser Lys Leu Gly Gin Leu Leu Val Tyr Glu Trp 
145 150 155 160 

Gin Ser Glu Ser Tyr lie Leu Lys Gin Gin Gly His Phe Asp Ser Thr 
10 165 170 175 

Asn Ser Leu Ala Tyr Ser Pro Asp Gly Ser Arg Val Val Thr Ala Ser 
180 185 190 

15 Glu Asp Gly Lys lie Lys Val Trp Asp lie Thr Ser Gly Phe Cys Leu 

195 200 205 

Ala Thr Phe Glu Glu His Thr Ser Ser Val Thr Ala Val Gin Phe Ala 

210 215 220 

20 

Lys Arg Gly Gin Val Met Phe Ser Ser Ser Leu Asp Gly Thr Val Arg 
225 230 235 240 

Ala Trp Asp Leu lie Arg Tyr Arg Asn Phe Arg Thr Phe Thr Gly Thr 
25 245 250 255 

Glu Arg lie Gin Phe Asn Cys Leu Ala Val Asp Pro Ser Gly Glu Val 
260 265 270 

3 0 Val Cys Ala Gly Ser Leu Asp Asn Phe Asp lie His Val Trp Ser Val 

275 280 285 

Gin Thr Gly Gin Leu Leu Asp Ala Leu Ser Gly His Glu Gly Pro Val 
290 295 300 



35 



Ser Cys Leu Ser Phe Ser Gin Glu Asn Ser Val Leu Ala Ser Ala Ser 
305 310 315 320 



xie Giu Vai Tyr Ser Asp Val Leu Ala Leu Ser Met: Arg 
340 345 350 
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He Phe Asn lie Glu Asp Ala Lys Gin Val Gly Asn lie Asp Cys Arg 



3 ?0 375 



380 



10 



15 



30 



35 



Lys Asp He lie Ser Gly Arg Phe Asn Gin Asp Arg Phe Thr Ala Lys 

385 395 ,„« 

jyi > 400 

He Leu Asn Asp Pro Asn Phe Leu Leu Gin Tyr lie Thr Val Leu Met 
405 410 415 

Val Trp Leu Leu Trp Leu Val Val lie lie Thr Pro Phe Val Tyr Met 



420 4 25 



430 



Met Phe Gin Met Lys Ser Cys 
435 

(2 J INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 



(vi) ORIGINAL SOURCE : 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN, Fig. 4 9 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 66: 

Met ser Thr Leu lie Pro Pro Pro Ser Lys Lys Gin Lys Lys Glu Ala 
1 



Gin Leu Pro Arg Glu Val Ala lie lie Pro TlV8 Asn i.r 



40 



45 

Leu Arg Val Pro Gly Ala lie Ser Glu Lys Gin Leu Glu Oh: t_ - 



WO 95/21252 



PCT/US95/01210 



- 169 - 

Asn Gin Leu Asn Gly Thr Ser Asp Asp Pro Val Pro Tyr Thr Phe Ser 
65 70 75 80 

Cys Thr lie Gin Gly Lys Lys Ala Ser Asp Pro Val Lys Thr lie Asp 
5 85 90 95 

lie Thr Asp Asn Leu Tyr Ser Ser Leu lie Lys Pro Gly Tyr Asn Ser 
100 105 110 

10 Thr Glu Asp Gin lie Thr Leu Leu Tyr Thr Pro Arg Ala Val Phe Lys 

115 120 125 

Val Lys Pro Val Thr Arg Ser Ser Ser Ala He Ala Gly His Gly Ser 
130 135 140 

15 

Thr He Leu Cys Ser Ala Phe Ala Pro His Thr Ser Ser Arg Met Val 
145 150 155 160 

Thr Gly Ala Gly Asp Asn Thr Ala Arg He Trp Asp Cys Asp Thr Gin 
20 165 170 175 

Thr Pro Met His Thr Leu Lys Gly His Tyr Asn Trp Val Leu Cys Val 
180 185 190 

25 Ser Trp Ser Pro Asp Gly Glu Val He Ala Thr Gly Ser Met Asp Asn 

195 200 205 

Thr lie Arg Leu Trp Asp Pro Lys Ser Gly Gin Cys Leu Gly Asp Ala 

210 215 220 

30 

Leu Arg Gly His Ser Lys Trp He Thr Ser Leu Ser Trp Glu Pro He 
225 230 235 240 

His Leu Val Lys Pro Gly Ser Lys Pro Arg Leu Ala Ser Ser Ser Lys 
35 245 250 255 

Asp Gly Thr He Lys He Trp Asp Thr Val Ser Arg Val Cys Gin Tyr 
260 265 270 



Gin Gly Leu Leu Tyr Ser Gly Ser His Asp Arg Thr Val Arg Val Trp 
290 295 300 
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305 310 315 320 

His Trp Val Asn His Leu Ser Leu Ser Thr Asp Tyr Ala Leu Arg lie 
325 330 335 

5 

Gly Ala Phe Asp His Thr Gly Lys Lys Pro Ser Thr Pro Glu Glu Ala 
340 345 350 

Gin Lys Lys. Ala Leu Glu Asn Tyr Glu Lys lie Cys Lys Lys Asn Gly 
10 355 360 365 

Asn ser Glu Glu Met Met Val Thr Ala Ser Asp Asp Tyr Thr Met Phe 
370 375 380 

15 Leu Trp Asn Pro Leu Lys Ser Thr Lys Pro lie Ala Arg Met Thr Gly 

385 390 395 400 

His Gin Lys Leu Val Asn His Val Ala Phe Ser Pro Asp Gly Arg Tyr 
405 410 415 

20 

lie Val Ser Ala Ser Phe Asp Asn Ser lie Lys Leu Trp Asp Gly Arg 
420 425 430 

Asp Gly Lys Phe lie Ser Thr Phe Arg Gly His lie Ala Ser Val Tyr 
25 435 440 445 

Gin Val Ala Trp Ser Ser Asp Cys Arg Leu Leu Val Ser Cys Ser Lys 
450 455 460 

3 0 Asp Thr Thr Leu Lys Val Trp Asp Val Arg Thr Arg Lys Leu Ser Val 

465 470 475 480 

Asp Leu Pro Gly lie Lys Thr Lys Leu Tyr Val Asp Trp Ser Val Asp 
485 490 495 



35 



Gly Lys Arg Val Cys Ser Gly Gly Lys Asp Lys Met Val Arg Leu Trp 

500 505 510 



N KOkMA I ION rOK SEg ±L- NO : 6 J . 
(i) SEQUENCE CHARACTERISTICS 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: YKL525, Fig. 50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

15 Met Phe Lys Ser Lys Thr Ser Thr Leu Ser Tyr Asp Glu Thr Pro Asn 

15 10 15 

Ser Asn Glu Gly Asp Arg Asn Ala Thr Pro Val Asn Pro Lys Glu Lys 
20 25 30 

20 

Ser Gin Thr Lys His Leu Asn lie Pro Gly Asp Arg Ser Arg His Ser 
35 40 45 

Ser lie Ala Asp Ser Lys Arg Ser Ser Ser Arg Tyr Asp Gly Gly Tyr 
25 50 55 60 

Ser Ala Asp lie lie Pro Ala Gin Leu Arg Phe lie Asp Asn lie Asp 
65 70 75 80 

3 0 Tyr Gly Thr Arg Leu Arg Lys Thr Leu His Arg Asn Ser Val Val Ser 

85 90 95 

Asn Gly Tyr Asn Lys Leu Ser Glu Asn Asp Arg Trp Tyr Phe Asp Leu 
100 105 110 



35 



Phe Asp Arg Lys Tyr Phe Glu Asn Tyr Leu Glu Glu Pro Thr Tyr lie 
115 120 125 



145 



^y^ .it: Asp 7a i Tyr Lys Ser Thr Thr Tyr 

150 155 160 
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Cys Cys Cys Thr Phe Ser His Asp Gly Lys Tyr Met Val lie Gly Cys 

180 185 190 

Lys Asp Gly Ser Leu His Leu Trp Lys Val lie Asn Ser Pro Val Lys 

195 200 205 

Arg Ser Glu Met Gly Arg Ser Glu Lys Ser Val Ser Ala Ser Arg Ala 

210 215 220 



10 



Asn Ser Leu Lys lie Gin Arg His Leu Ala Ser He Ser Ser His Asn 
225 230 235 240 



15 



Gly Ser He Ser Ser Asn Asp Leu Lys Pro Ser Asp Gin Phe Glu Gly 

245 250 255 

Pro Ser Lys Gin Leu His Leu Tyr Ala Pro Val Phe Tyr Ser Asp Val 

260 265 270 



20 



Phe Arg Val Phe Met Glu His Ala Leu Asp He Leu Asp Ala Asn Trp 
275 280 285 



Ser Lys Asn Gly Phe Leu He Thr Ala Ser Met Asp Lys Thr Ala Lys 
290 295 300 



25 



Leu Trp His Pro Glu Arg Lys Tyr Ser Leu Lys Thr Phe Val His Pro 
305 310 315 320 



30 



Asp Phe Val Thr Ser Ala He Phe Phe Pro Asn Asp Asp Arg Phe He 

325 330 335 

lie Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser He Leu Asp 
340 345 350 



35 



Asn Glu Val Ser Tyr Ala Phe Asp Cys Lys Asp Leu He Thr Ser Leu 
355 360 365 



Thr Leu Ser Pro Pro Gly Gly Glu Tyr Thr He He Gly Thr Phe Asn 
370 375 ™~ 



Phe His Val Ser Asp Lys Ser Thr Gin Gly Thr Thr Lys Asn Ser Phe 
4 05 41^ 
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420 425 430 

Gly Leu Gin Cys Phe Phe Ser Lys Val Asp Lys Asn Leu Arg Leu lie 
435 440 445 

5 

Val Thr Thr Asn Asp Ser Lys lie Gin lie Phe Asp Leu Asn Glu Lys 
450 455 460 

Lys Pro Leu Glu Leu Phe Lys Gly Phe Gin Ser Gly Ser Ser Arg His 
10 465 470 475 480 

Arg Gly Gin Phe Leu Met Met Lys Asn Glu Pro Val Val Phe Thr Gly 
485 490 495 

15 Ser Asp Asp His Trp Phe Tyr Thr Trp Lys Met Gin Ser Phe Asn Leu 

500 505 510 

Ser Ala Glu Met Asn Cys Thr Ala Pro His Arg Lys Lys Arg Leu Ser 
515 520 525 

20 

Gly Ser Met Ser Leu Lys Gly Leu Leu Arg lie Val Ser Asn Lys Ser 
530 535 540 

Thr Asn Asp Glu Cys Leu Thr Glu Thr Ser Asn Gin Ser Ser Ser His 
25 545 550 555 560 

Thr Phe Thr Asn Ser Ser Lys Asn Val Leu Gin Thr Gin Thr Val Gly 
565 570 575 

30 Ser Gin Ala lie Lys Asn Asn His Tyr lie Ser Phe His Ala His Asn 

580 585 590 

Ser Pro Val Thr Cys Ala Ser lie Ala Pro Asp Val Ala lie Lys Asn 
595 600 605 



35 



Leu Ser Leu Ser Asn Asp Leu lie Phe Glu Leu Thr Ser Gin Tyr Phe 

610 615 620 



rtiii. i-rc Vai Thr G i u 'I'hr Gly Gly Phe Ser Ser Asn Leu 

645 650 655 
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Gin Gly Leu He Arg Val Phe Arg Thr Asp He Leu Pro Glu He Arg 
675 680 685 

Lys Lys He He Glu Lys Phe His Glu Tyr Asn Leu Phe His Leu Glu 
5 690 695 700 

Ala Ala Gly Lys He Asn Asn His Asn Asn Asp Ser He Leu Glu Asn 
705 710 715 720 

10 Arg Met Asp Glu Arg Ser Ser Thr Glu Asp Asn Glu Phe Ser Thr Thr 

725 730 735 

Pro Pro Ser Asn Thr His Asn Ser Arg Pro Ser His Asp Phe Cys Glu 
740 745 750 

15 

Leu His Pro Asn Asn Ser Pro Val He Ser Gly Met Pro Ser Arg Ala 
755 760 765 

Ser Ala He Phe Lys Asn Ser He Phe Asn Lys Ser Asn Gly Ser Phe 
20 770 775 780 

He Ser Leu Lys Ser Arg Ser Glu Ser Thr Ser Ser Thr Val Phe Gly 
785 790 795 800 

25 Pro His Asp He Pro Arg Val Ser Thr Thr Tyr Pro Lys Leu Lys Cys 

805 810 815 

Asp Val Cys Asn Gly Ser Asn Phe Glu Cys Ala Ser Lys Asn Pro He 

820 825 830 

30 

Ala Gly Gly Asp Ser Gly Phe Thr Cys Ala Asp Cys Gly Thr He Leu 
835 840 845 

Asn Asn Phe Arg 
35 850 

(2) INFORMATION FOR SEQ ID NO: 68: 



(ii) MOLECULE TYPE: protein 

4^ 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: yrb 1410 yeast, Fig. 51 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

Met Ser Gin Lys Gin Ser Thr Asn Gin Asn Gin Asn Gly Thr His Gin 
10 1 5 10 15 

Pro Gin Pro Val Lys Asn Gin Arg Thr Asn Asn Ala Ala Gly Ala Asn 
20 25 30 

15 Ser Gly Gin Gin Pro Gin Gin Gin Ser Gin Gly Gin Ser Gin Gin Gin 

35 40 45 

Gly Arg Ser Asn Gly Pro Phe Ser Ala Ser Asp Leu Asn Arg lie Val 
50 55 60 

20 

Leu Glu Tyr Leu Asn Lys Lys Gly Tyr His Arg Thr Glu Ala Met Leu 
65 70 75 80 

Arg Ala Glu Ser Gly Arg Thr Leu Thr Pro Gin Asn Lys Gin Ser Pro 
25 85 90 95 

Ala Asn Thr Lys Thr Gly Lys Phe Pro Glu Gin Ser Ser lie Pro Pro 
100 105 110 

30 Asn Pro Gly Lys Thr Ala Lys Pro He Ser Asn Pro Thr Asn Leu Ser 

115 120 125 

Ser Lys Arg Asp Ala Glu Gly Gly He Val Ser Ser Gly Arg Leu Glu 
130 135 140 



35 



Gly Leu Asn Ala Pro Glu Asn Tyr He Arg Ala Tyr Ser Met Leu Lys 

145 150 155 160 



ryr Pro x^e Phe lie Tyr Leu Phe Leu Asn Leu Val Ala Lys 
180 185 190 
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Lys Asp Phe His Gly Ser Glu He Asn Arg Leu Phe Ser Val Asn Ser 
210 215 220 

He Asp His He Lys Glu Asn Glu Val Ala Ser Ala Phe Gin Ser His 
5 225 230 235 240 

Lys Tyr Arg lie Thr Met Ser Lys Thr Thr Leu Asn Leu Leu Leu Tyr 
245 250 255 

10 Phe Leu Asn Glu Asn Glu Ser He Gly Gly Ser Leu He lie Ser Val 

260 265 270 

lie Asn Gin His Leu Asp Pro Asn lie Val Glu Ser Val Thr Ala Arg 
275 280 285 

15 

Glu Lys Leu Ala Asp Gly lie Lys Val Leu Ser Asp Ser Glu Asn Gly 
290 295 300 

Asn Gly Lys Gin Asn Leu Glu Met Asn Ser Val Pro Val Lys Leu Gly 
20 305 310 315 320 

Pro Phe Pro Lys Asp Glu Glu Phe Val Lys Glu lie Glu Thr Glu Leu 
325 330 335 

2 5 Lys lie Lys Asp Asp Gin Glu Lys Gin Leu Asn Gin Gin Thr Ala Gly 

340 345 350 

Asp Asn Tyr Ser Gly Ala Asn Asn Arg Thr Leu Leu Gin Glu Tyr Lys 
355 360 365 

30 

Ala Met Asn Asn Glu Lys Phe Lys Asp Asn Thr Gly Asp Asp Asp Lys 
370 375 380 

Asp Lys lie Lys Asp Lys lie Ala Lys Asp Glu Glu Lys Lys Glu Ser 
35 385 390 395 400 

Glu Leu Lys Val Asp Gly Glu Lys Lys Asp Ser Asn Leu Ser Ser Pro 
405 410 415 



Leu Glu He Gin Lys Val Lys Glu Ser Arg Asp Ala lie Lys Leu Asp 
435 440 445 
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450 455 460 

Thr Asn Lys Asp Met Ser Cys Leu Asp Phe Ser Asp Asp Cys Arg lie 
465 470 475 480 

5 

Ala Ala Ala Gly Phe Gin Asp Ser Tyr lie Lys lie Trp Ser Leu Asp 
485 490 495 

Gly Ser Ser Leu Asn Asn Pro Asn lie Ala Leu Asn Asn Asn Asp Lys 
10 500 505 510 

Asp Glu Aap Pro Thr cys Lys Thr Leu Val Gly His Ser Gly Thr Val 
515 520 525 

15 Tyr Ser Thr Ser Phe Ser Pro Asp Asn Lys Tyr Leu Leu Ser Gly Ser 

530 535 540 

Glu Asp Lys Thr Val Arg Leu Trp Ser Met Asp Thr His Thr Ala Leu 
545 550 555 560 

20 

Val Ser Tyr Lys Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser 
565 570 575 

Pro Leu Gly His Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg 
25 580 585 590 

Leu Trp Ser Cys Asp His lie Tyr Pro Leu Arg He Phe Ala Gly His 
595 600 605 

30 Leu Asn Asp Val Asp Cys Val Ser Phe His Pro Asn Gly Cys Tyr Val 

610 615 620 

Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp Val Ser Thr 
625 630 635 640 



35 



Gly Asp Ser Val Arg Leu Phe Leu Gly His Thr Ala Pro Val He Ser 
645 650 655 



675 



680 



685 
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Glu Gly Asn Val Leu He Ser Gly Gly Ala Asp His Thr Val Arg Val 
705 710 715 720 

Trp Asp Leu Lys Lys Ala Thr Thr Glu Pro Ser Ala Glu Pro Asp Glu 
5 725 730 735 

Pro Phe He Gly Tyr Leu Gly Asp Val Thr Ala Ser He Asn Gin Asp 
740 745 750 

10 He Lys Glu Tyr Gly Arg Arg Arg Thr Val He Pro Thr Ser Asp Leu 

755 760 765 

Val Ala Ser Phe Tyr Thr Lys Lys Thr Pro Val Phe Lys Val Lys Phe 
770 775 780 



15 



Ser Arg Ser Asn Leu Ala Leu Ala Gly Gly Ala Phe Arg Pro 
785 790 795 



20 (2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
25 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 protein rl, Fig. 1C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



20 



/v^a ot'i Asp Lys 'In r lie lie Met Trp Lys 

25 30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) CRIMINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 protein rll, Fig. 1C 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Gly His Ser His Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin 

15 10 15 

20 

Phe Ala Leu Ser Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 71: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 protein rill , Fig. 1C 



Gly His Thr Lys Asp Val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg 
15 10 15 
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20 25 30 



(2) INFORMATION FOR SEQ ID NO: 72: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

Uii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 protein rIV, Fig. 1C 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser 
1 5 10 15 

25 Asn Pro lie lie Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val Trp 

20 ' 25 30 

Asn 



(2) INFORMATION FOR SEQ ID NO:73: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



/vNTi - SENSE ; 
(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gly His Thr Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser 
15 10 15 

5 

Leu Cys Ala Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp 
20 25 30 



10 



15 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

2 0 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 protein rVI , Fig. 1C 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asp Gly Gly Asp lie lie Asn Ala Leu Cys Phe Ser Pro Asn Arg 

15 10 15 

Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser lie Lys lie Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RACK1 protein rVII, Fig. 1C 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser Leu Ala Trp Ser Ala Asp 
15 10 15 

10 Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp Asn Leu Val Arg Val Trp 

20 25 30 

Gin 



15 



(2) INFORMATION FOR SEQ ID NO: 76: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: Human 55 kDa protein rl , Fig. 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

35 Gly His Thr Asp Ala Val Leu Asp Leu Ser Trp Asn Lys Leu lie Arg 

15 10 15 

Asn Val Leu Ala Ser Ala Ser Ala Asp Asn Thr Val lie Leu Trp Asv 



(i) SEQUENCE CHARACTERISTICS 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 (iii) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: Human 55 kDa protein rll, Fig. 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

15 Ala His Asn Asp Glu lie Ser Gly Leu Asp Leu Ser Ser Gin lie Lys 

15 10 15 

Gly Cys Leu Val Thr Ala Ser Ala Asp Lys Tyr Val Lys lie Trp Asp 
20 25 30 

20 

(2) INFORMATION FOR SEQ ID NO:78: 

<i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

30 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Human 5 5 kDa protein rill, Fig. 11 



Cys Pro Asp Leu Pro Phe lie Tyr Ala Phe Gly Gly Gin Lys Glu Gly 
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Leu Arg Val Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 79: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein rl , Fig. 12 



20 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Gly Asn Lys Lys Lys Ser Thr Ser Val Ala Trp Asn Ala Asn Gly Thr 
15 10 15 

Lys He Ala Ser Ser Gly Ser Asp Gly He Val Arg Val Trp Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

35 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



'RIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein rll, Fig. 12 
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Gly His Asp Gly Ser lie Glu Lys He Ser Trp Ser Pro Lys Asn Asn 
15 10 15 

Asp Leu Leu Ala Ser Ala Gly Thr Asp Lys Val He Lys He Trp Asp 

5 20 25 30 



(2) INFORMATION FOR SEQ ID NO: 81: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: AAC-RICH protein rill, Fig. 12 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 

Asp His Leu Ala Leu He Asp Leu Pro Thr He Lys Thr Leu Lys He 

15 10 15 

30 Tyr Lys Phe Asn Gly Glu Glu Leu Asn Gin Val Gly Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 82: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



nil) 
(iv) 



HYPOTHETICAL : NO 

ANTT- SENSE: NH 
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(C) INDIVIDUAL ISOLATE: AAC-RICH protein rIV, Fig. 12 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

5 

Gly His Thr Ala Ser lie Tyr Cys Met Glu Phe Asp Pro Thr Gly Lys 
15 10 15 

Tyr Leu Ala Ala Gly Ser Ala Asp Ser lie Val Ser Leu Trp Asp 
10 20 25 30 

(2) INFORMATION FOR SEQ 1JJ NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

25 <vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rl , Fig. 13 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

30 

lie His Cys Arg Ser Glu Thr Ser Lys Gly Val Tyr Cys Leu Gin Tyr 

15 10 15 

Asp Asp Gin Lys lie Val Ser Gly Leu Arg Asp Asn Thr lie Lys lie 
35 20 25 30 

Trp Asp 



z. v U CiN C MAKA L ' i E K 1 b T I - . 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rll, Fig. 13 



10 



15 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly His Thr Gly Ser Val Leu Cys Leu Gin Tyr Asp Glu Arg Val II 
15 10 15 

He Thr Gly Ser Asp Ser Thr Val Arg Val Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rill, Fig. 13 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

He His His Cys Glu Ala Val Leu His Leu Arq Php Asn A^r mo 

20 25 30 



(2) INFORMATION FOR SEQ ID NO : 8 6 
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(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rIV, Fig. 13 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 

Gly His Arg Ala Ala Val Asn Val Val Asp Phe Asp Asp Lys Tyr lie 
15 10 15 

2 0 Val Ser Ala Ser Gly Asp Arg Thr lie Lys Val Trp Asn 

20 25 

(2) INFORMATION FOR SEQ ID NO: 87: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: BETA TRCP rV, Fig. 13 



■.j *//t: Arg I.e Aj.a Jys Leu Gin Tyr Arg Asp Arg Leu Val 

15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: BETA TRCP rVI, Fig. 13 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

20 Gly His Glu Glu Leu Val Arg Cys lie Arg Phe Asp Asn Lys Arg lie 

15 10 15 

Val Ser Gly Ala Tyr Asp Gly Lys lie Lys Val Trp Asp 
20 25 



25 



(2) INFORMATION FOR SEQ ID NO: 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

35 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 9 
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Val Ser Ser Ser His Asp Asp Thr lie Leu lie Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 90: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta-prime-cop rl, Fig. 14 



20 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Ala His Ser Asp Tyr lie Arg Cys lie Ala Val His Pro Thr Gin Pro 
15 10 15 

Phe lie Leu Thr Ser Ser Asp Asp Met Leu lie Lys Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



MUlNAi, SOURCE. 

(C) INDIVIDUAL ISOLATE: beta-prime- cop rll, Fig. 14 
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Gly His Thr His Tyr Val Met Gin He Val He Asn Pro Lys Asp Asn 
15 10 15 

Asn Gin Phe Ala Ser Ala Ser Leu Asp Arg Thr He Lys Val Trp Gin 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 92: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta-prime-cop rill, Fig. 14 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Gly His Glu Lys Gly Val Asn Cys He Asp Tyr Tyr Ser Gly Gly Asp 

15 10 15 



30 



35 



Lys Pro Tyr Leu He Ser Gly Ala Asp Asp Arg Leu Val Lys He Trp 
20 25 30 



Asp 



(2) INFORMATION FOR SEQ ID NO: 93: 
(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE : peptide 



10 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: beta-prime-cop rIV, Fig. 14 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Gly His Ala Gin Asn Val Ser Cys Ala Ser Phe His Pro Glu Leu Pro 
15 10 15 

lie lie lie Thr Gly Ser Glu Asp Gly Thr Val Arg lie Trp His 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
25 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rl , Fig. 15 



15 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Gly His Met Thr Ser Val He Thr Cys Leu Gin Phe Glu Asp Asn Tyr 
1 5 10 15 

Val He Thr Gly Ala Asp Asp Lys Met He Arg Val Tyr Asp 

20 25 30 



.Ay LENGTH : 3 0 ammo acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 



5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rll, Fig. 15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

10 

Gly His Asp Gly Gly Val Trp Ala Leu Lys Tyr Ala His Gly Gly Tie 
x 5 10 15 

Leu Val Ser Gly Ser Thr Asp Arg Thr Val Arg Val Trp Asp 
15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE :. peptide 

25 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

3 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rill, Fig. 15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

35 

Gly His Asn Ser Thr Val Arg Cys Leu Asp lie Val Glu Tyr Lys Asn 
15 10 15 
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5 



15 



20 



25 



30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE; NO 



(C) INDIVIDUAL ISOLATE: CDC4 / CDC2 0 protein rlV, Fig. 15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Gly His Met Ala Ser Val Arg Thr Val Ser Gly His Gly Asn lie Val 
15 10 15 

Val Ser Gly Ser Tyr Asp Asn Thr Leu lie Val Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NQ:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC 4 / CDC20 protein rV, Fig. 15 



Gly His Thr Asp Arg lie Tyr Ser Thr lie Tyr Asp His Glu Arg Lys 
15 10 15 
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20 25 30 

(2) INFORMATION FOR SEQ ID NO: 99: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CDC4 / CDC20 protein rVI , Fig. 15 



2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Gly His Thr Ala Leu Val Gly Leu Leu Arg Leu Ser Asp Lys Phe Leu 
15 10 15 

25 Val Ser Ala Ala- Ala Asp Gly Ser lie Arg Gly Trp Asp 

20 25 

(2) INFORMATION FOR SEQ ID NO: 100: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



i* I G INAi. . ■ .URCh 

(C) INDIVIDUAL ISOLATE: GBLF - CHLAMIDOMONAS HOMOLOG rl, Fig 
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35 
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Gly His Thr Asn Trp Val Thr Ala lie Ala Thr Pro Leu Asp Pro Ser 
15 10 15 

Ser Asn Thr Leu Leu Ser Ala Ser Arg Asp Lys Ser Val Leu Val Trp 
20 25 30 



Glu 



10 (2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP - CHLAM I DOMONAS HOMOLOG rll, Fig. 



25 16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

30 Gly His Ser His Phe Val Gin Asp Val Val lie Ser Ser Asp Gly Gin 

15 10 15 

Phe Cys Leu Thr Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 102: 
(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE: peptide 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP - CHLAMIDOMONAS HOMOLOG rill, Fig. 

5 16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

Gly His Thr Lys Asp Val Leu Ser Val Ala Phe Ser Val Asp Asn Arg 
10 1 5 10 15 

Gin lie Val Ser Gly Ser Arg Asp Lys Thr lie Lys Leu Trp Asn 
20 25 30 

15 (2) INFORMATION FOR SEQ ID NO:103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
2 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



25 



30 16 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG rIV, Fig. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Gly His Thr Glu Trp Val Ser Cys Val Arg Phe Ser Pro Met Thr Thr 
35 1 5 10 15 

Asn Pro He He Val Ser Gly Gly Trp Asp Lys Met Val Lys Val Trp 
20 25 ~i o 



(2) INFORMATION FOR SEQ ID NO: 104 
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(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG rV, Fig. 

16 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

Gly His His Gly Tyr Val Asn Thr Val Thr Val Ser Pro Asp Gly Ser 
15 10 15 

20 Leu Cys Ala Ser Gly Gly Lys Asp Gly lie Ala Met Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 105: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG rVI , Fig. 



16 



lie His Cys Leu Cys Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala 
15 10 15 



4^ 
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5 



10 



20 
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20 25 30 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GBLP -CHLAMIDOMONAS HOMOLOG rVII, Fig. 



16 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: 

Lys Lys Ala Gin Val Pro Tyr Cys Val Ser Leu Ala Trp Ser Ala Asp 
25 1 5 10 15 

Gly Ser Thr Leu Tyr Ser Gly Tyr Thr Asp Gly Gin lie Arg Val Trp 
20 25 30 

30 Ala 



(2) INFORMATION FOR SEQ ID NO: 107: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



_ di'POTHETICAL ; NO 
(iv) ANTI-SENSE: NO 
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(C) INDIVIDUAL ISOLATE: cop-1 protein rl, Fig. 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 

5 

Met Ser Thr Arg Ser Lys Leu Ser Cys Leu Ser Trp Asn Lys His Glu 
15 10 15 

Lys Asn His lie Ala Ser Ser Asp Tyr Glu Gly lie Val Thr Val Trp 
10 20 25 30 



15 (2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
{B) TYPE: amino acid 
2 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: cop-1 protein rll, Fig. 17 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

Glu Lys Arg Ala Trp Ser Val Asp Phe Ser Arg Thr Glu Pro Ser Met 
35 1 5 10 15 

Leu Val Ser Gly Ser Asp Asp Cys Lys Val Lys Val Trp Cys 
20 25 30 



(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: cop-1 protein rill, Fig. 17 



10 



15 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

Gly His Lys Lys Ala Val Ser Tyr Met Lys Phe Leu Ser Asn Asn Glu 
15 10 15 

Leu Ala Ser Ala Ser Thr Asp Ser Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: Coronin (p5 5) rl , Fig. 19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Gly His Lys Ser Ala Val Leu Asp lie Ala Phe His Pro Phe Asn Glu 



45 



(2> INFORMATION F^p cro m v ^ 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 



15 



20 



(C) INDIVIDUAL ISOLATE: Coronin (p55) rll, Fig. 19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Gly His Lys Arg Lys Val Gly Thr He Ser Phe Gly Pro Val Ala Asp 
15 10 is 

Asn Val Ala Val Thr Ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp 
20 25 30 



25 (2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
3 0 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 



35 



^UENC?. DESCRIPTION; SEQ ID NG:I^: 
Gly His Ser Asp Met lie Thr Ser Cyg Glu Trr> Ann ot„ 

,.1 c 
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Gin He Val Thr Thr Cys Lys Asp Lys Lys Ala Arg Val Phe Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



<ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

Cvi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN rl, Fig. 18 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Arg His Val Phe Ala Ala Gin Pro Lys Lys Glu Glu Cys Tyr Gin Asn 
1 5 10 15 

25 

Leu Lys Thr Lys Ser Ala Val Trp Asp Ser Asn Tyr Val Ala Ala Asn 
20 25 30 

Thr Arg Tyr He Trp Asp 

30 35 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 



(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Gly His Lys Ser Ala Val Leu Asp lie Ala Phe His Pro Phe Asn Glu 
5 15 10 15 

Asn Leu Val Gly Ser Val Ser Glu Asp Cys Asn lie Cys lie Trp Gly 

20 25 30 



10 



(2) INFORMATION FOR SEQ ID NO: 115: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: CORO PROTEIN rill, Fig. 18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

3 0 Gly His Lys Arg Lys Val Gly Thr He Ser Phe Gly Pro Val Ala Asp 

15 10 15 

Asn Val Ala Val Thr Ser Ser Gly Asp Phe Leu Val Lys Thr Trp Asp 
20 25 30 

35 

(2) INFORMATION FOR SEQ ID NO: 116: 



(ii) MOLECULE TYPE: peptide 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CORO PROTEIN rIV, Fig. 18 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

Gly His Ser Asp Met lie Thr Ser Cys Glu His Asn Gly Ser Gin lie 
10 1 5 10 15 

Val Thr Thr Cys Lys Asp Lys Lys Ala Arg Val Phe Asp 
20 25 

15 (2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 5 0kDa rl , Fig. 2 0 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

Asp His Val Asp Glu Val Thr Cys Leu Ala Phe His Pro Thr Glu Gin 
35 1 5 10 15 

lie Leu Ala Ser Gly Ser Arg Asp Tyr Thr Leu Lys Leu Phe Asp 

20 2^ ^ 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 50kDa rll, Fig. 2 0 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Asp His Val Asp Glu Val Thr Cys Leu Ala Phe His Pro Thr Glu Gin 
15 10 15 

lie Leu Ala Ser Gly Ser Arg Asp Tyr Thr Leu Lys Leu Phe Asp 
20 25 30 



2 0 (2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
2 5 ( D ) TOPOLOGY : • unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF 50kDa rill, Fig. 20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



^eu Ser Ser Gly Lys Asp Ser Val A^a Lys Leu Trp Glu 
20 25 30 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI- SENSE : NO 

ivi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF SOkDa rIV, Fig. 20 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Val His Arg Thr Gin Ala Val Phe Asn His Thr Glu Asp Tyr Val Leu 
15 10 15 

20 

Leu Pro Asp Glu Arg Thr lie Ser Leu Cys Cys Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO: 121: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: CSTF SOkDa rV, Fig. 2 0 



Gly His Asn Asn He Val Arg Cys He Val His Ser Pro Thr Asn Pro 
15 10 IS 
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20 25 



PCT/US95/01210 



30 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



f T 1 1 1 UVn/Mmrnm-r - < - 



(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE : 

(O INDIVIDUAL ISOLATE : G- BETA DROSOPH rl, Pig. 23 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

Oly His Leu Ala L ys He Tyr Ala Met His Trp 01y Asn Asp fier Arg 
5 10 15 

Asn Leu val Ser Ala Ser Gin Asp Gly Lys Leu lie Val Trp Asp 

20 j - 

^ b 30 

{2} INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
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Gly His Gly Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 
15 10 15 

He Val Thr Ser Ser Gly Asp Met Ser Cys Gly Leu Trp Asp 
5 20 25 30 

(2) INFORMATION FOR SEQ ID NO:124: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE : peptide 

15 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

2 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- BETA DROSOPH rill, Fig. 23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: 

25 

Gly His Thr Gly Asp Val Met Ala Leu Ser Leu Ala Pro Gin Cys Lys 

15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Trp Asp 

30 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(:v) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 5: 

Gly His Glu Ser Asp He Asn Ala Val Thr Phe Phe Pro Asn Gly Gin 
15 10 15 

5 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
20 25 30 



10 



15 



35 



(2) INFORMATION FOR SEQ ID NO:126: 

(i) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 34 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI -SENSE: NO 

<vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- BETA DROSOPH rV, Fig. 23 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

Ser His Asp Asn He He Cys Gly He Thr Ser Val Ala Phe Ser Lys 
15 10 15 

3 0 Ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Asn Val 

20 25 30 

Trp Asp 



(2) INFORMATION FOR SEQ ID NO: 12 7: 
(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE : peptide 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G- BETA DROSOPH rVI , Fig. 23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Glu Asn Gly Met 
10 1 5 10 15 

Ala Val Ala Thr Gly Sex Tip Asp Ser Phe Leu Arg Val Trp Asn 
20 25 30 

15 (2) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
2 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 

(iv) ANTI -SENSE: NO ■ 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rl, Fig. 24 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12 8: 

Gly His Asn Gly Trp Val Thr Gin lie Ala Thr Thr Pro Gin Phe Pro 
15 10 15 

Asp Met lie Leu Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys 

20 25 30 



25 



30 



35 



(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 

Ciii) HYPOTHETICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rll, Fig. 24 



10 



15 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 

Gly His Ser His Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin 
15 10 15 

Phe Ala Leu Ser Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

3 0 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rill, Fig. 24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

Gly His Thr Lya Asp Val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg 



(2) INFORMATION FOR SEQ ID NO: 131: 

45 
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(A) LENGTH: 3 3 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unJcnown 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 

10 

(vi) ORIGINAL SOURCE: 

{C} INDIVIDUAL ISOLATE: Ci-BETA HUMAN rIV, Fig. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

15 

Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser 
1 5 10 15 

Asn Pro lie He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val Trp 
20 20 25 30 

Asn 



25 (2) INFORMATION FOR SEQ ID NO:132: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
3 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

35 

(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 



Gly His Thr Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser 
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Leu Cys Ala Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 
(BJ TYPE : amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-BETA HUMAN rVI, Fig. 24 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

Lys His Leu Tyr Thr Leu Asp Gly Gly Asp lie lie Asn Ala Leu Cys 
15 10 15 

Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro Ser lie 

20 25 30 



Lys lie Trp Asp 

30 35 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 

Ala Glu Pro Pro Gin Cys Thr Ser Leu Ala Trp Ser Ala Asp Gly Gin 

5 15 10 15 

Thr Leu Phe Ala Gly Tyr Thr Asp Asn Leu Val Arg Val Trp Gin 

20 25 30 

10 (2) INFORMATION FOR SEQ ID NO: 135: 

{ i ) S EOUENCE CHARACTER ICTIC3 : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

20 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rl, Fig. 21 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 5: 

Gly His Leu Ala Lys lie Tyr Ala Met His Trp Gly Thr Asp Ser Arg 

30 1 5 10 15 

Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie lie Trp Asp 
20 25 30 

35 (2) INFORMATION FOR SEQ ID NO:136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE- amip^ q^-M 



(iii) HYPOTHETICAL: NO 

45 
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<vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE : G-Beta 1 bovine rll, Fig. 21 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 
15 10 15 

10 lie Val Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 137: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

{ D ) TOPOLOGY : unknown 

2 0 <ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rill, Fig. 21 



30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Gly His Thr Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Thr Arg 
15 10 15 

3 5 Leu Phe Val Ser Gly Ala Cys Asp Ala Ser Ala Lys Leu Trp Asp 

20 25 30 



(2) INFORMATION FOR SEQ ID NO: 13 8: 



(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rIV, Fig. 21 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

10 

Gly His Glu Ser Asp lie Asn Ala lie Cys Phe Phe Pro Asn Gly Asn 
15 10 15 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

3 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rV , Fig. 21 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Ser His Asp Asn lie lie' Cys Gly lie Thr Ser Val Ser Phe Ser Lys 

15 10 15 

Ser Glv Ara T ; *=v \ T.^i 1 T ^ ^ -> r "-- 1 



45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta 1 bovine rVI, Fig. 21 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Asp Asp Gly Met 
15 10 15 

20 

Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys lie Trp Asn 
20 25 30 

{2) INFORMATION FOR SEQ ID NO: 141: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
3 5 (iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE : 

(C) INDIVIDUAL ISOLATE: G-Beta -bovine ( 2 ) rl, Via 77 



Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly Thr Asp Ser Arg 
1 5 10 
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20 25 30 

(2) INFORMATION FOR SEQ ID NO: 142: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta-bovine (2 ) rll, Fig. 22 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:142: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 
15 10 15 

2 5 lie lie Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 143: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

3 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTT-SENSF- NO 



4^ 
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Gly His Ser Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Gly Arg 
15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser lie Lys Leu Trp Asp 

5 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( u ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

15 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta-bovine ( 2 ) rIV, Fig. 22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

25 

Gly His Glu Ser Asp lie Asn Ala Val Ala Phe Phe Pro Asn Gly Tyr 

15 10 15 

Ala Phe Thr Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 

30 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(iv) ANTI-SENSE: NO 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 * 

Ser His Asp Asn lie lie Cys Gly lie Thr Ser Val Ala Phe Ser Arg 

5 15 10 15 

Ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Asn lie 
20 25 30 

10 Trp Asp 

(2) INFORMATION FOR SEQ ID NO: 14 6: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta-bovine { 2 ) rVI , Fig. 22 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Asp Asp Gly Met 
15 10 15 



Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys lie Trp Asn 
35 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 147: 



^ , TOPOLOGY : unknot: 
(ii) MOLECULE TYPE: peptide 
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(iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G - Be ta2 (Human) rl, Fig. 25 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 7: 

Gly His Leu Ala Lys He Tyr Ala Met His Trp Gly Thr Asp Ser Arg 
10 l s 10 15 

t.<hi Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie lie Trp Asp 
20 25 30 

15 (2) INFORMATION FOR SEQ ID NO: 14 8: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
2 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
25 : 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta2 (Human) rll, Fig. 25 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Asn Gin 
35 1 5 10 15 

He He Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 

20 25 30 



(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 

{iii) HYPOTHETICAL: NO 

5 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta2 (Human) rill, Fig. 25 



10 



15 



20 



25 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 

Gly His Ser Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Gly Arg 

15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser lie Lys Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

3 0 (iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G - Be ta2 (Human) rIV, Fig. 25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

Gly His Glu Ser Asp lie Asn Ala Val Ala Phe Phe Pro Asn Glv T\ 



(2) INFORMATION FOR SEQ ID NO: 151: 
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(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Bcta2 (Human) iV, Fig. 2b 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Ser His Asp Asn lie lie Cys Gly lie Thr Ser Val Ala Phe Ser Arg 
15 10 15 

20 Ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Asn lie 

20 25 30 

Trp Asp 



25 



(2) INFORMATION FOR SEQ ID NO: 152: 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 31 amino acids 
3 0 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

3 5 (iii) HYPOTHETICAL: NO 

(iv) ANT I - SENSE : NO 

(vi) ORIGINAL SOURCE ■ 



SEQUENCE DESCRIPTION: SEQ ID NO: 15; 



4 F n 1 . v Hi r- ftsr 
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10 



20 



25 



30 



35 
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Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Lys lie Trp Asn 
20 25 30 

{2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rl, Fig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:153: 

Gly His Leu Ala Lys lie Tyr Ala Met His Trp Gly Tyr Asp Ser Arg 
15 10 15 

Leu Leu Val Ser Ala Ser Gin Asp Gly Lys Leu lie lie Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



KiGINKi, SOURCE. 

(C) INDIVIDUAL ISOLATE: G-Beta4 imouse ; rll, Fig. 26 
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Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe Leu Asp Asp Gly Gin 
15 10 15 

lie lie Thr Ser Ser Gly Asp Thr Thr Cys Ala Leu Trp Asp 

5 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY- unknown 

(ii) MOLECULE TYPE: peptide 

15 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

2 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rill, Fig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

25 

Gly His Ser Gly Asp Val Met Ser Leu Ser Leu Ser Pro Asp Leu Lys 

15 10 15 

Thr Phe Val Ser Gly Ala Cys Asp Ala Ser Ser Lys Leu Trp Asp 

30 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECTTLF! TYP^ • -pr^^ 



iv> ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:156: 

Gly His He Ser Asp He Asn Ala Val Ser Phe Phe Pro Ser Gly Tyr 
5 15 10 15 

Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 
20 25 30 

10 (2) INFORMATION FOR SEQ ID NO: 157: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

<C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rV, Fig. 26 

25 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15 7: 

Ser His Asp Asn He He Cys Gly He Thr Ser Val Ala Phe Ser Lys 

30 1 5 10 15 

Ser Gly Arg Leu Leu Leu Ala Gly Tyr Asp Asp Phe Asn Cys Ser Val 
20 25 30 

35 Trp Asp 



(2) INFORMATION FOR SEQ ID NO: 158: 



(D) TOPOLOGY : unknown 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: G-Beta4 (mouse) rVI , Fig. 26 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

10 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Asp Asp Gly Met 
15 10 15 

Ala Val Ala Thr Gly Ser Trp Asp Ser Phe Leu Arg lie Trp Asn 
15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

3 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GROUCHO PROT . DRSPH rl , Fig. 27 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

Thr Ser Ala Ala Pro Ala Cys Tyr Ala Leu Ala Ser Pro Asp Ser Lys 
15 10 15 



(i) SEQUENCE CHARACTERISTICS: 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: GROUCHO PR07 . DRSPH rll, Fig. 2 7 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

15 Gly His Thr Asp Gly Ala Ser Cys He Asp He Ser Pro Asp Gly Ser 

15 10 15 

Arg Leu Trp Thr Gly Gly Leu Asp Asn Thr Val Arg Ser Trp Asp 
20 25 30 



20 



(2) INFORMATION FOR SEQ ID NO: 161: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

3 0 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: GTP binding prt squid rl , Fig. 2 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 



Asn Leu Val Ser Ala Ser Gin Asp Gly Lys Leu He Val Trp Asp 
20 25 30 



4c; 



WO 95/21252 PCT/US95/01210 

- 230 - 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI -SENSE: NO 

(vij ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rll, Fig. 2 8 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Gly His Thr Gly Tyr Leu Ser Cys Cys Arg Phe lie Asp Asp Asn Gin 
15 10 15 

20 

lie Val Thr Ser Ser Gly Asp Met Thr Cys Ala Leu Trp Asn 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP bindina prt squid rTTT, Fia 79 



25 



30 



Gly His Thr Gly Asp Val Met Ser Leu Ser Leu Ala Pro Asp Met Arg 
15 10 IS 



WO 95/21252 



PCT/US95/01210 



15 



- 231 - 

20 25 30 



(2) INFORMATION FOR SEQ ID NO:164: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

( iii) HYPOTHKT I CAL : NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rIV, Fig. 28 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: 

Gly His Glu Ser Asp lie Asn Ala lie Thr Tyr Phe Pro Asn Gly Phe 
15 10 15 

25 Ala Phe Ala Thr Gly Ser Asp Asp Ala Thr Cys Arg Leu Phe Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 165: 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

3 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



. ^\ . r.. 
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Ser His Asp Asn He He Cys Gly He Thr Ser Val Ala Phe Ser Lys 
15 10 15 

Ser Gly Arg Leu Leu Leu Gly Gly Tyr Asp Asp Phe Asn Cys Asn Val 
20 25 30 

Trp Asp 



10 (2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
Ciii) HYPOTHETICAL: NO 
(iv) ANT I- SENSE: NO 



20 



25 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: GTP binding prt squid rVI, Fig. 28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:166: 

Gly His Asp Asn Arg Val Ser Cys Leu Gly Val Thr Glu Asp Gly Met 

15 10 15 

Ala Val Ala Thr Gly Ser Trp Asp 
20 

(2) INFORMATION FOR SEQ ID NO: 167: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 



(iii) HYPOTHETICAL : NO 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

Gly His Asn Gly Trp Val Thr Gin He Ala Thr Thr Pro Gin Phe Pro 
5 15 10 15 

Asp Met He Leu Ser Ala Ser Arg Asp Lys Thr He He Met Trp Lys 
20 25 30 



10 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 173: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : . 
25 (C) INDIVIDUAL ISOLATE: HUMAN 12.3 rll, Fig. 30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



Gly His Ser His Phe Val Ser Asp Val Val He Ser Ser Asp Gly Gin 

15 10 15 

Phe Ala Leu Ser Gly Ser Trp Asp Gly Thr Leu Arg Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE : peptide 
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<iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rill, Fig. 30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 

Gly His Thr Lys Asp Val Leu Ser Val Ala Phe Ser Ser Asp Asn Arg 
10 1 5 10 15 

Gin He Val Ser Gly Ser Arg Asp Lys Thr He Lys Leu Trp Asn 
20 25 30 

15 (2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 
2 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 



25 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rIV, Fig. 30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

Ser His Ser Glu Trp Val Ser Cys Val Arg Phe Ser Pro Asn Ser Ser 
35 1 5 10 15 

Asn Pro He He Val Ser Cys Gly Trp Asp Lys Leu Val Lys Val Trp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 176 
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(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

10 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rV, Fig. 30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:176: 

15 

Gly His Thr Gly Tyr Leu Asn Thr Val Thr Val Ser Pro Asp Gly Ser 
15 10 15 

Leu Cys Ala Ser Gly Gly Lys Asp Gly Gin Ala Met Leu Trp Asp 
20 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



30 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



3 5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rVI , Fig. 30 



Phe Ser Pro Asn Arg Tyr Trp Leu Cys Ala Ala Thr Gly Pro S^r 
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10 



20 



25 
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Lys lie Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: HUMAN 12.3 rVII, Fig. 3 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 

Val lie Ser Thr Ser Ser Lys Ala Glu Pro Pro Gin Cys Thr Ser Leu 
15 10 15 

Ala Trp Ser Ala Asp Gly Gin Thr Leu Phe Ala Gly Tyr Thr Asp Asn 

20 25 30 



Leu Val Arg Val Trp Gin 

30 35 

(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: 

Gly His Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Ser Asn Leu Ser 
5 15 10 15 

Gly His Leu Leu Ser Ala Ser Asp Asp His Thr Val Cys Leu Trp Asp 
20 25 30 



10 



(2) INFORMATION FOR SEQ ID NO: 180: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
15 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: IEF- 7442 -human rll, Fig. 31 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

30 Gly His Ser Ala Val Val Glu Asp Val Ala Trp His Leu Leu His Glu 

15 10 15 

Ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met lie Trp Asp 
20 25 30 

35 

(2) INFORMATION FOR SEQ ID NO: 181: 



i'GPOLOG: : 
(ii) MOLECULE TYPE: 



unxnowu 
peptide 
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(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: IEF- 7442 -human rill, Fig. 31 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Ala His Thr Ala Glu Val Asn Cys Leu Ser Phe Asn Pro Tyr Ser Glu 
10 1 5 10 15 

Fhe lie Leu Ala Thr Gly Ser Ala Asp Lys Thr Val Ala Leu Trp Asp 
20 25 30 



15 



(2) INFORMATION FOR SEQ ID NO: 182: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: IEF- 744 2 -human rIV, Fig. 31 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

35 Val His Trp Ser Pro His Asn Glu Thr He Leu Ala Ser Ser Gly Thr 

1 5 io 15 

Asp Arg Arg Leu Asn Val Trp Asp 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: IEF- 7442 -human rV, Fig. 31 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 

Gly His Thr Ala Lys lie Ser Asp Phe Ser Trp Asn Pro Asn Glu Pro 
15 10 15 

Trp Val lie Cys Ser Val Ser Glu Asp Asn lie Met Gin He Trp Gin 
20 25 30 



2 0 (2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino, acid 
25 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



30 



(iv) ANT I- SENSE: NO 



(vi) ORIGINAL SOURCE; 

(C) INDIVIDUAL ISOLATE: Insulin-like GF binding 
35 protein complex rl. Fig. 32 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO r 1 84 ■ 



Ser Arg Leu Glu Asp Gly Leu Phe Glu Gly Leu Gly Ser Leu Trp Asp 
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(2) INFORMATION FOR SEQ ID NO: 18 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: Insulin- like growth factor bind. 

pro. complex- rat rl, Fig. 33 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

20 Thr His Thr Pro Ser Leu Ala Ser Leu Ser Leu Ser Ser Asn Leu Leu 

15 10 15 

Gly Arg Leu Glu Glu Gly Leu Phe Gin Gly Leu Ser His Leu Trp Asp 
20 25 30 

25 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

3 0 (A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



35 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



pro. complex- rat rl 



Fig. U 
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Asn His Leu Glu Thr Leu Ala Glu Gly Leu Phe Ser Ser Leu Gly Arg 
15 10 15 



Val Arg Tyr Leu Ser Leu Arg Asn Asn Ser Leu Gin Thr Phe Ser Pro 
5 20 25 30 

Gin Pro Gly Leu Glu Arg Leu Trp Leu Asp Ala Asn Pro Trp Asp 
35 40 45 



10 (2) INFORMATION FOR SEQ ID NO: 187: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL; NO 

20 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LIS1 (human) rl, Fig. 34 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



Gly His Arg Ser Pro Val Thr Arg Val He Phe His Pro Val Phe Ser 

30 1 5 10 15 

Val Met Val Ser Ala Ser Glu Asp Ala Thr He Lys Val Trp Asp 

20 25 30 

3 5 (2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

>A,LECULfc. 1YPL. peptide 
(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE : 

(C) INDIVIDUAL ISOLATE: LIS1 (human) rll, Fig. 34 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:188: 

Gly His Thr Asp Ser Val Gin Asp lie Ser Phe Asp His Ser Gly Lys 
15 10 15 

10 Leu Leu Ala Ser Cys Ser Ala Asp Met Thr lie Lys Leu Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 189: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

20 (ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 



25 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LIS1 (human) rill, Fig. 34 



3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

Gly His Asp His Asn Val Ser Ser Val Ala lie Met Pro Asn Gly Asp 

15 10 15 

35 His lie Val Ser Ala Ser Arg Asp Lys Thr lie Lys Met Trp Glu 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 190: 



. B / TYPE: ammo acid 
(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LIS1 (human) rIV, Fig. 34 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

10 

Gly His Arg Glu Trp Val Arg Met Val Arg Pro Asm nin Asp Gly Thr 
15 10 15 

Leu lie Ala Ser Cys Ser Asn Asp Gin Thr Val Arg Val Trp Val 

15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 

3 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: LIS1 (human) rV, Fig. 34 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

Gly Ser Glu Thr Lys Lys Ser Gly Lys Pro Gly Pro Phe Leu Leu Ser 
15 10 15 



INFORMATION FOR SEQ ID NO: 192 
fi) SEQUENCE CHARACTERISTICS 



WO 95/21252 PCT/US95/01210 

- 247 - 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 (iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: LIS1 (human) rVI, Fig. 34 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:192: 

15 Gly His Asp Asn Trp Val Arg Gly Val Leu Phe His Ser Gly Gly Lys 

15 10 15 

Phe He Leu Ser Cys Ala Asp Asp Lys Thr Leu Arg Val Trp Asp 
20 25 30 



20 



(2) INFORMATION FOR SEQ ID NO:193: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
25 (B) TYPE: amino acid 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

3 0 (iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: LIS1 (human) rVII, Fig. 34 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 



Tyr Val Val Thr Gly Ser Val Asp Gin Thr Val Lys Val Trp Glu 

20 25 "*0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI-SENSE: NO 

(vij ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 rl, Fig. 3 5 



15 



20 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

Gly His Ser Ala Arg Val Tyr Ala Leu Tyr Tyr Lys Asp Gly Leu Leu 
1 5 10 is 

Cys Thr Gly Ser Asp Asp Leu Ser Ala Lys Leu Trp Asp 
20 25 

(2) INFORMATION FOR SEQ ID NO:195: 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 27 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 rIT, Fiq ^ 



Thr His Thr Cys Ala Ala Val Lys Phe Asp Glu Gin Lys Leu Val Thr 

1 5 : n 
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20 25 

(2) INFORMATION FOR SEQ ID NO:196: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



15 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MD6 rill, Fig. 35 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: 

Gly His Thr Gly Ala Val Phe Ser Val Asp Tyr Ser Asp Glu Leu Asp 
15 10 15 

25 He Leu Val Ser Gly Ser Ala Asp Phe Ala Val Lys Val Trp Ala 

20 25 30 

(2) INFORMATION FOR SEQ ID NO:197: 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

35 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



INDIVIDUAL ISOLATE ; MDb rIV, Fig. 3b 
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Gly His Thr Glu Trp Val Thr Lys Val Val Leu Gin Lys Cys Lys Val 
15 10 15 

Lys Ser Leu Leu His Ser Pro Gly Asp Tyr lie Leu Leu Ser Ala Asp 
5 20 25 30 

Lys Tyr Glu lie Lys lie Trp Pro 
35 40 

10 (2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MSL1 rl , Fig. 36 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe Asn Tyr Lys Asn Ser 

30 1 5 10 15 

Leu lie Leu Ala Ser Ala Asp Ser Asn Gly Arg Leu Asn Leu Trp Asp 

20 25 30 



35 



(2) INFORMATION FOR SEQ ID NO: 199: 
(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE: peptide 
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(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MSL1 rll, Fig. 36 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe Asp 
10 1 5 10 15 

Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu Trp 
20 25 30 

15 Asp 

(2) INFORMATION FOR SEQ ID NO: 2 00: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

2 5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



30 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MSL1 rill, Fig. 36 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



Gly His Met Leu Gly Val Asn Asp lie Ser Trp Asp Ala His Asp Pro 
1 5 10 ^ 



INFORMATION F^p SFQ IP NO ■ ?. n ^ 
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(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MUS MUSCULUS PROTEIN rl , Fig. 3 7 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 01: 

Gly His Ser Gly Cys Val Asn Thr Val His Phe Asn Gin His Gly Thr 
15 10 15 



10 



20 



Leu Leu Ala Ser Gly Ser Asp Asp Leu Lys Val lie Val Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 202: 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: MUS MUSCULUS PROTEIN rll, Fig. 3 7 



■3iy His Ue Phe lie Trp Giu Lys Ser Ser Cys Gin He Val Gin Phe 
1 5 10 15 
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Tyr Leu Pro Val Leu Ala Ser Ser Gly Leu Asp His Glu Val Lys He 
35 40 45 

Trp Ser 

50 



(2) INFORMATION FOR SEQ ID NO: 203: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: ORF RBI rl , Fig. 38 



2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 03: 

Lys His Asp Gly Gly Val Asn Ser Cys Arg Phe Asn Tyr Lys Asn Ser 
15 10 15 



30 



35 



Leu He Leu Ala Ser Ala Asp Ser Asn Gly Arg Leu Asn Leu Trp Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 2 04: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: ORF RBI rll, Fig. 38 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: 

Glu His Gly Thr Ser Val Ser Thr Leu Glu Trp Ser Pro Asn Phe Asp 
15 10 15 

10 Thr Val Leu Ala Thr Ala Gly Gin Glu Asp Gly Leu Val Lys Leu Trp 

20 25 30 



Asp 



15 



(2) INFORMATION FOR SEQ ID NO:205: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: ORF RBI rill, Fig. 38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

35 Gly His Met Leu Gly Val Asn Asp He Ser Trp Asp Ala His Asp Pro 

15 10 15 

Trr> T'pn Mpt- Cvn JO ^ 7v«^ i\nr TT ' ^ " T ' 1 ^ v 



(2) INFORMATION FOR SEQ ID NO: 206: 



c FQtTFN^F CHAFA-^FP T r T"T rr 
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(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
10 (C) INDIVIDUAL ISOLATE: Periodic Trp prt rl, Fig. 39 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: 

15 Gly His lie Thr Thr His His Thr Asp Ala Val Leu Ser Met Ala His 

15 10 15 

Asn Lys Tyr Phe Arg Ser Val Leu Ala Ser Thr Ser Ala Asp His Thr 

20 25 30 

20 

Val Lys Leu Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 2 07: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE : peptide 
(iii) HYPOTHETICAL: NO 
35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE- PonnH- 



lie His Ser Asn Lys Asn Val Ser Ser Ser Glu Trp His Met Leu Asn 

1 ^ 
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20 25 30 

Asp Val Arg lie Ser Asp Glu Ser Gin Met Ser Lys Tyr Trp Ser 
35 40 45 



5 



(2) INFORMATION FOR SEQ ID NO: 208: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
10 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : peptide 

15 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
20 (C) INDIVIDUAL ISOLATE: PLAP rl , Fig. 40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

2 5 Gly His Lys Asp Thr Val Cys Ser Leu Ser Ser Gly Lys Phe Gly Thr 

15 10 15 

Leu Leu Ser Gly Ser Trp Asp Thr Thr Ala Lys Val Trp Leu 
20 25 30 

30 

(2) INFORMATION FOR SEQ ID NO: 2 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



ANTI- SENSE: NO 
(vi) ORIGINAL SOURCE: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 9: 

Gly His Thr Ala Ala Val Trp Ala Val Lys lie Leu Pro Glu Gin Gly 
15 10 15 

5 

Leu Met Leu Thr Gly Ser Ala Asp Lys Thr lie Lys Leu Trp Lys 
20 25 30 



10 



15 



25 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PLAP rill, Fig. 4 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

Gly His Glu Asp Cys Val Arg Gly Leu Ala lie Leu Ser Glu Thr Glu 
15 10 15 

Phe Leu Ser Cys Ala Asn Asp Ala Ser lie Arg Arg Trp Gin 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE : amino acid 



(iii) HYPOTHETICAL: NO 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: PLAP rIV, Fig. 4 0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 11: 

Gly His Thr Asn Tyr He Tyr Ser He Ser Val Phe Pro Asn Ser Lys 
1 5 10 15 

Asp Phe Val Thr Thr Ala Glu Asp Arg Ser Leu Arg He Trp Lys 

20 2R 3n 



(2) INFORMATION FOR SEQ ID NO:212: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

2 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 
HUMAN, rl, Fig. 41 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212: 

Gly His Gin Lys Glu Gly Tyr Gly Leu Ser Trp Asn Pro Asn Leu Ser 
1 5 10 15 

Gly His Leu Leu Ser Ala Ser Asp Asp His Thr He Cys Leu Trp Asp 
20 25 30 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 
HUMAN rll, Fig. 41 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 

Gly His Thr Ala Val Val Glu Asp Val Ser Trp His Leu Leu His Glu 
15 1 5 10 15 

Ser Leu Phe Gly Ser Val Ala Asp Asp Gin Lys Leu Met He Trp Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 2 14: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
25 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

3 0 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 

HUMAN rill, Fig. 41 



5 10 is 

Asn Pro Tyr Ser Glu Phe II ^ Ten A1 => T u >- r, <— " ■ - 
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5 



10 



25 
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Val Ala Leu Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

Ciii) HYPOTHETICAL: NO 

15 (iv) ANTI -SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: RETINOBLASTOMA BINDING PROTEIN 
HUMAN rIV, Fig. 41 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 

Ser His Lys Asp Glu He Phe Gin Val Gin Trp Ser Pro His Asn Glu 

15 10 is 

Thr He Leu Ala Ser Ser Gly Thr Asp Arg Arg Leu Asn Val Trp Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 216: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



Uv, ANTI - SENSE : NO 
'vil ORIGINAL SOURCE 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

Gly His Thr Ala Lys He Ser Asp Phe Ser Trp Asn Pro Asn Glu Pro 

5 15 10 15 

Trp Val He Cys Ser Val Ser Glu Asp Asn He Met Gin Val Trp Gin 
20 25 30 



10 



30 



35 



(2) INFORMATION FOR SEQ ID NO: 217: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
15 (B) TYPE: amino acid 

<D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 
25 (C) INDIVIDUAL ISOLATE: S253 PROTEIN rl, Fig. 42 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 



Glu His Ala Leu Asp He Leu Asp Ala Asn Trp Ser Lys Asn Gly Phe 
15 10 is 

Leu He Thr Ala Ser Met Asp Lys Thr Ala Lys Leu Trp His 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 2 18: 
(i) SEQUENCE CHARACTERISTICS: 



(ii) MOLECULE TYPE : peptide 
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(iv) ANT I- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: S253 PROTEIN rll, Fig. 42 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: 

Val His Pro Asp Phe Val Thr Ser Ala He Phe Phe Pro Asn Asp Asp 
10 1 5 io 15 

Arg Phe He He Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser 
20 25 30 



15 



(2) INFORMATION FOR SEQ ID NO: 219: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
2 0 (b) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

25 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

30 (C) INDIVIDUAL ISOLATE: SOF1 rl , Fig. 43 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 9: 

35 Gly His Arg Asp Gly Val Tyr Ala He Ala Lys Asn Tyr Gly Ser Leu 

15 io is 

Asn Lys Leu Ala Thr Gly Ser Ala Asp Glv Val Tie T.vp Tvr Tr^ 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
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(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOF1 rll, Fig. 43 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

Gly Leu Cys Val Thr Gin Pro Arg Phe His Asp Lys Lys Pro Asp Leu 
15 10 15 

Lys Ser Gin Asn Phe Met Leu Ser Cys Ser Asp Asp Lys Thr Val Lys 
20 25 30 

Leu Trp Ser 
20 35 

(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 3 5 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

30 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 

35 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOF1 rill, Fig. 43 



15 



Ser His Arg Glu Asn Ser Thr Phe Ala Th> 
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5 



10 
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Leu Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO:222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: SOF1 rIV, Fig. 43 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:222: 

Gly His Ser Arg Glu lie Tyr His Thr Lys Arg Met Gin His Val Phe 
1 5 , 10 15 

25 

Val Lys Tyr Ser Met Asp Ser Lys Tyr lie He Ser Gly Ser Asp Asp 
20 25 30 

Gly Asn Val Arg Leu Trp Arg 

30 35 

(2) INFORMATION FOR SEQ ID NO:223: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 



•: he - : 



(iv) ANTI-SENSE: NO 



WO 95/21252 PCI7US95/01210 

- 265 - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

Gly His Asn Asn Lys lie Ser Asp Phe Arg Trp Ser Arg Asp Ser Lys 

5 15 10 15 

Arg lie Leu Ser Ala Ser Gin Asp Gly Phe Met Leu lie Trp Asp 
20 25 30 

10 (2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL |SOLATE: STE4- YEAST rll, Fig. 44 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

Gly His Thr Cys Tyr lie Ser Asp lie Glu Phe Thr Asp Asn Ala His 

30 1 5 10 15 

lie Leu Thr Ala Ser Gly Asp Met Thr Cys Ala Leu Trp Asp 
20 25 30 

3 5 (2) INFORMATION FOR SEQ II? NO: 225: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH • 37 amino ^i^c 



25 



iii] MOLECULE TYPE: peptide 
' i i. ^ HYPOTHFTTCA T • 
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(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: STE4- YEAST rill, Fig. 44 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 5: 

Asp His Leu Gly Asp Val Leu Ala Leu Ala lie Pro Glu Glu Pro Asn 
15 10 15 

Leu Glu Asn Ser Ser Asn Thr Phe Ala Ser Cys Gly Ser Asp Gly Tyr 

20 25 3C 

Thr Tyr lie Trp Asp 
35 

(2) INFORMATION FOR SEQ ID NO: 22 6: 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

2 5 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
30 (C) INDIVIDUAL ISOLATE: STE4 - YEAST rIV, Fig. 44 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

35 Leu Asp Asn Gin Gly Val Val Ser Leu Asp Phe Ser Ala Ser Gly Arg 

15 10 15 



Leu Met Tvr Ser <~vp Tb>- 



(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amirm a-i-^- 
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fii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

5 Civ) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(O INDIVIDUAL ISOLATE: STS4 -YEAST rV, Fig. 44 



10 



15 



20 



25 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 227: 

Gly His Gly Gly Arg Val Thr Gly Val Arg Ser Ser Pro Asp Gly Leu 



15 



Ala Val Cys Thr Gly Ser Trp Asp Ser Thr Met Lys He Trp Ser 



20 25 



30 



(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

30 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TIIF rl, Fig. 45 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:228: 

Gly His Thr Gly Pro Val Tyr Arg Cys Ala Phe Ala Pro 01, * f 



" 3 3 0 

(2) INFORMATION FOR SEQ ID NO: 22 9: 
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(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANT I- SENSE: NO 



10 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TIIF rll, Fig. 45 



15 (jci) SEQUENCE DESCRIPTION : SEQ ID NO:229: 

Gly His Val Tyr Pro Val Trp Asp Val Arg Phe Ala Pro His Gly Tyr 

15 10 15 

20 Tyr Phe Val Ser Cys Ser Tyr Asp Lys Thr Ala Arg Leu Trp Ala 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 23 0: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TIIF rill, Fig. 4 5 



Gly His Leu Ser Asp Val Asp Cys Val Gin Phe His Pro Asn Ser Asn 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: TRNSCRPTION FCTR TIIF rIV, Fig. 45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

20 Gly His Lys Gly Ser Val Ser Ser Leu Ala Phe Ser Ala Cys Gly Arg 

15 10 15 

Tyr Leu Ala Ser Gly Ser Val Asp His Asn lie lie lie Trp Asp 
20 , 25 30 



25 



(2) INFORMATION FOR SEQ ID NO: 232: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

30 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

35 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 
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Val Leu Ala Ala Ala Gly Leu Asp Asn Asn Leu Thr Leu Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 rl, Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

Ser Ser Asp Leu Tyr lie Arg Ser Val Cys Phe Ser Pro Asp Gly Lys 
1 5 ; 10 15 

Phe Leu Ala Thr Gly Ala Glu Asp Arg Leu lie Arg lie Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



ORIGINAL SOURCE; 

(C) INDIVIDUAL ISOLATE: TUP1 rll, Fig. 46 
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Gly His Glu Gin Asp lie Tyr Ser Leu Asp Tyr Phe Pro Ser Gly Asp 
15 10 15 

Lys Leu Val Ser Gly Ser Gly Asp Arg Thr Val Arg He Trp Asp 

5 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



15 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : NO 



20 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 rill, Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

25 

He Glu Asp Gly Val Thr Thr Val Ala Val Ser Pro Gly Asp Gly Lys 

15 10 15 

Tyr He Ala Ala Gly Ser Leu Asp Arg Ala Val Arg Val Trp Asp 

30 20 25 30 



(2) INFORMATION FOR SEQ ID NO: 236: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 



..i i ., HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 
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(C) INDIVIDUAL ISOLATE: TUP1 rIV, Fig. 46 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 : 

5 

Gly His Lys Asp Ser Val Tyr Ser Val Val Phe Thr Arg Asp Gly Gin 
15 10 15 

Ser Val Val Ser Gly Ser Leu Asp Arg Ser Val Lys Leu Trp Asn 
10 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 23 7: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
{D} TOPOLOGY: unknown 



20 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



25 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 rV, Fig. 46 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

30 

Gly His Lys Asp Phe Val Leu Ser Val Ala Thr Thr Gin Asn Asp Glu 
15 10 15 

Tyr He Leu Ser Gly Ser Lys Asp Arg Gly Val Leu Phe Trp Asp 

35 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 23 8: 



TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 HOMOLOG rl, Fig. 4 7 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

Asp Phe Ser Asp Asp Cys Arg lie Ala Ala Ala Gly Phe Gin Asp Ser 
10 1 5 10 15 

Tyr lie Lys lie Trp Ser 
20 

15 (2) INFORMATION FOR SEQ ID NO: 23 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
2 0 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

25 

(iv) ANTI- SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 HOMOLOG rll, Fig. 47 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 

Gly His Ser Gly Thr Val Tyr Ser Thr Ser Phe Ser Pro Asp Asn Lys 

35 1 5 10 15 

Tyr Leu Leu Ser Gly Ser Glu Asp Lys Thr Val Arg Leu Trp Ser 

20 25 30 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
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(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

5 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rill, Fig. 47 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser Pro Leu Gly His 
15 io 15 

Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg Leu Trp Ser 
20 25 30 



20 (2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
2 5 (D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 



30 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUPl HOMOLOG rIV, Fig. 47 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 



:y: Vai Phe Thr G.y Ser Ser Asp Lys Thr Cys Arg Met Trp Asp 
20 25 30 



WO 95/21252 



PCT/US95/01210 



- 275 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

5 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 HOMOLOG rV, Fig. 47 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:242: 

Gly His Thr Ala Pro Val lie Ser lie Ala Val Cys Pro Asp Gly Arg 
15 10 15 

20 

Trp Leu Ser Thr Gly Ser Glu Asp Gly lie lie Asn Val Trp Asp 
20 25 30 

(2) INFORMATION FOR SEQ ID ^0:243: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

30 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 
35 (iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: TUP1 HOMOLOG rVI , Fig. 47 



Gly His Gly Lys Asn Ala lie Tyr Ser Leu Ser Tyr Ser Lys Glu Gly 

1 5 10 1 c 
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20 



25 



30 



(2) INFORMATION FOR SEQ ID NO: 244: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

10 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
15 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE : 

(C) INDIVIDUAL ISOLATE: YCU7 rl, Fig. 48 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 244: 

Gly His Phe Asp Ser Thr Asn Ser Leu Ala Tyr Ser Pro Asp Gly Ser 
1 5 10 15 

25 

Arg Val Val Thr Ala Ser Glu Asp Gly Lys He Lys Val Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 245: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

35 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



ORIGINAL SOURCE: 
(C) INDIVIDUAL ISOLATE: YCU7 rll, Fig. 48 
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Glu His Thr Ser Ser Val Thr Ala Val Gin Phe Ala Lys Arg Gly Gin 
15 10 15 

Val Met Phe Ser Ser Ser Leu Asp Gly Thr Val Arg Ala Trp Asp 

5 20 25 30 



(2) INFORMATION FOR SEQ ID NO: 246: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

15 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

<iv) ANT I- SENSE: NO 



20 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCU7 rill, Fig. 48 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

Arg lie Gin Phe Asn Cys Leu Ala Val Asp Pro Ser Gly Glu Val Val 
15 10 15 

30 Cys Ala Gly Ser Leu Asp Asn Phe Asp He His Val Trp Ser 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 24 7: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acida 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



iiUJ HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(C) INDIVIDUAL ISOLATE : YCU7 rIV, Fig. 4 8 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO:247: 

5 

Gly His Glu Gly Pro Val Ser Cys Leu Ser Phe Ser Gin Glu Asn Ser 
15 10 15 

Val Leu Ala Ser Ala Ser Trp Asp Lys Thr He Arg He Trp Ser 
10 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

20 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

2 5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rl, Fig. 49 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: 

30 

Gly His Gly Ser Thr He Leu Cys Ser Ala Phe Ala Pro His Thr Ser 
15 10 15 

Ser Arg Met Val Thr Gly Ala Gly Asp Asn Thr Ala Arg He Trp Asp 
35 20 25 30 



INFORMATION FOR SEQ ID NO: 24 9' 



(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rll, Fig. 49 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

10 

Glv His Tyr Asn Trp Val Leu Cys Val Ser Trp Ser Pro Asp Gly Glu 
1 5 10 15 

Val lie Ala Thr Gly Ser Met Asp Asn Thr He Arg Leu Trp Asp 

15 20 25 30 

(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 38 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



25 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

3 0 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE : YCW2 PROTEIN rill, Fig. 49 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

35 Gly His Ser Lys Trp He Thr Ser Leu Ser Trp Glu Pro He His Leu 

15 10 15 

Val Lys Pro Gly Ser Lys Pro Arg Leu Ala Ser Ser Ser Lys Asp Glv 



35 



(7^ INFORMATION FOP. SFQ TD NO: 2 51 
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(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; unknown 

5 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

10 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rIV, Fig. 49 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

Gly His Thr Asn Ser Val Ser Cys Val Lys Trp Gly Gly Gin Gly Leu 

15 10 15 

2 0 Leu Tyr Ser Gly Ser His Asp Arg Thr Val Arg Val Trp Asp 

20 25 30 

(2) INFORMATION FOR SEQ ID NO: 252: 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

3 0 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



35 



(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rV, Fig. 4 9 



Lys lie Cys Lys Lys Asn Gly Asn Ser Giu Giu Met Met Val Thr Ala 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 2 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

15 (C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rVI , Fig. 49 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

20 Asn His Val Ala Phe Ser Pro Asp Gly Arg Tyr lie Val Ser Ala Ser 

15 10 15 

Phe Asp Asn Ser lie Lys Leu Trp Asp 
20 25 



25 



(2) INFORMATION FOR SEQ ID NO: 254: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
3 0 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
3 5 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 54: 
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Leu Leu Val Ser Cys Ser Lys Asp Thr Thr Leu Lys Val Trp Asp 

20 25 30 



5 



10 



(2) INFORMATION FOR SEQ ID NO: 2 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

15 (iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: YCW2 PROTEIN rVIII, Fig. 4 9 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 

Ser Val Asp Leu Pro Gly He Lys Thr Lys Leu Tyr Val Asp Trp Ser 
1 5 io 15 

25 Val As P Gly Lys Arg Val Cys Ser Gly Gly Lys Asp Lys Met Val Arg 

20 * 25 30 



Leu Trp Thr 
35 
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(2) INFORMATION FOR SEQ ID NO:256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: YKL525 rl, Fig. 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 56: 

20 Leu His Leu Tyr Ala Pro Val Phe Tyr Ser Asp Val Phe Arg Val Phe 

15 10 15 



Met 

25 



Glu His Ala Leu Asp lie Leu Asp Ala Asn Trp Ser 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 2 57: 



PCT/US95/01210 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: YKL525 rll, Fig. 50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:257: 

2 0 Val His Pro Asp Phe Val Thr Ser Ala He Phe Phe Pro Asn Asp Asp 

15 10 15 



25 



Arg Phe He He Thr Gly Cys Leu Asp His Arg Cys Arg Leu Trp Ser 
20 , 25 30 
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(2) INFORMATION FOR SEQ ID NO: 25 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: yrb 1410 yeast rl, Fig. 51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

2 0 Gly His Asn His Pro Val Trp Asp Val Ser Phe Ser Pro Leu Gly His 

15 10 15 



Tyr Phe Ala Thr Ala Ser His Asp Gin Thr Ala Arg Leu Trp Ser 
20 25 30 



25 
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(2) INFORMATION FOR SEQ ID NO: 25 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: yrb 1410 yeast rll, Fig. 51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

Gly His Leu Asn Asp Val Asp Cys Val Ser Phe His Pro Asn Gly Cys 
15 10 15 



25 



Tyr Val Phe Thr Gly Ser Ser Asp Lys Thr Cys Arg Met Trp Asp 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: yrb 1410 yeast rill, Fig. 51 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

20 Gly His Thr Ala Pro Val lie Ser He Ala Val Cys Pro Asp Gly Arg 

15 10 15 

Trp Leu Ser Thr Gly Ser Glu Asp Gly He He Asn Val Trp Asp 
20 25 30 



25 
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(2) INFORMATION FOR SEQ ID NO: 261: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

5 (B) TYPE: amino acid 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (C) INDIVIDUAL ISOLATE: yrb 1410 yeast rIV, Fig. 51 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO:261: 

20 Gly His Gly Lys Asn Ala lie Tyr Ser Leu Ser Tyr Ser Lys Glu Gly 

15 10 15 



25 



Asn Val Leu lie Ser Gly Gly Ala Asp His Thr Val Arg Val Trp Asp 
20 25 30 
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10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : peptide 

Ciii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WD40 Consensus Sequence 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

Gly His Ser Ala Ala Leu Ala Ala Leu Ala Leu Ser Pro Asp Ala Ala 
15 10 15 

Ala Ala Ala Leu Ala Ser Gly Ala Arg Asp Ala Thr Leu Arg Leu Trp 



20 



25 



30 



Asp 



Leu 
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(2) INFORMATION FOR SEQ ID NO: 2 63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE: peptide 



(llil TTYPnTUFTTPaT.. vcc 



(iv) ANT I- SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WRTAA peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:263: 

20 Trp Arg Thr Ala Ala 

1 5 



! 
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(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



(ii) MOLECULE TYPE : peptide 



(ii-n HYPOTHETICAL; YES 



(iv) ANTI- SENSE: NO 

15 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WRTAV peptide 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 



20 



Trp Arg Thr Ala Val 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 26 5: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

10 

(iii) HYPOTHETICAL: YES 



(iv) ANTI- SENSE: NO 



!5 (vi) ORIGINAL SOURCE: 

(C) INDIVIDUAL ISOLATE: WRTA peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 



20 



Trp Arg Thr Ala 

1 
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Claims 

1. A polypeptide composition effective to alter the 
activity of a first protein, wherein the first protein interacts with a 
5 second protein, and the second protein contains at least one WD-4 0 
region, 

said polypeptide having between 4 and 50 amino acids whose 
sequence is the same as a sequence of the same length in the WD-4 0 region 
of the second protein. 

10 

o "^he composition of claim 1 , v/hsrsin Cuid polypeptide 

inhibits interactions between the first protein and the second protein; 
and/or wherein said polypeptide is an agonist of the activity of the 
first protein; and/or wherein said polypeptide is an antagonist of the 
15 activity of the first protein. 

3. The composition of claim 1 or 2, wherein said WD-40 
region has an amino acid sequence derived from the group consisting of 
SEQ ID NO:76-261. 

20 

4. The composition of claim 3, wherein said WD-40 region 
has an amino acid sequence selected from the group consisting of SEQ ID 
NO:76-261. 

25 5. The polypeptide composition of claim 1 wherein said 

polypeptide is coupled to a solid support. 

6. A method to bind selectively said first protein which 
method comprises contacting a sample putatively containing said first 

3 0 protein with the polypeptide composition of claim 5; and 

removing any unbound components of the sample from said 

composition. 

7. A method to assess the interaction of a first protein 
35 with a polypeptide having a sequence the same as a sequence of the same 

length contained in a WD-40 region of a second protein, which method 
comprises 

rontactinq a sample r-on^a i ni nn pta-id f •? r^*- 



of the first protein with said polypeptide composition. 

o r. Tfif;^H tn n n n ° r r t~ Vi n 'b ' 1 "i *~ v ^ f ~\ ^ i " ^ i i i ' n ^ ~ ^ 1 ' *" 

;. i . :■ *., .1 r . v/i r : i. i s ...... ypept ioe ■:*ompcs i . w;;; rw.'i.s c>.n . ; : i 
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30 



35 
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wherein the polypeptide of said composition has between 4 and 50 amino 
acids whose sequence is the same as a sequence of the same length in a 
WD-40 region of a second protein which interacts with said first protein, 
in the presence and absence of said candidate compound; and 

measuring the binding of said polypeptide in the presence 
and in the absence of said candidate, 

wherein decreased binding of the polypeptide in the presence 
as opposed to the absence of said candidate indicates that said candidate 
binds to said first protein. 



9. A method to alter the activity of a first protein th*t 
interacts with a second protein, where the second protein contains at 
least one WD-40 region, said method comprising 

selecting a polypeptide having between 4 and 50 amino acids 
15 whose sequence is the same as a sequence of the same length in the WD-4 0 
region in the second protein, and 

contacting said polypeptide with said first protein under 
conditions which allow the formation of a complex between the polypeptide 
and the first protein, where said interaction is effective to alter the 
20 activity of the first protein. 

10. The method of claim 9, wherein said contacting is 
effective to inhibit the interaction between said first and second 
proteins; and/or wherein said contacting is effective to stimulate the 
activity of said first protein; and/or wherein said contacting is 
effective to inhibit the activity of said first protein. 

11. The. method of any of claims 5-10, wherein said 
polypeptide is derived from the group consisting of SEQ ID NO:76-261. 

12. The method of claim 11, wherein said polypeptide is 
selected from the group consisting of SEQ ID NO: 76-261. 

13 . A composition of DNA molecules which consists of DNA 
molecules having a nucleotide sequence encoding the polypeptide of any 
of claims 1-4. 

14. A DNA molecule which comprises an expression svst-pm fnr 



encoding nucleotide sequence. 



15 



Recombinant host cell^ modifir.^ i- - 
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16 . A method to produce a polypeptide having between 4 and 
50 amino acids whose sequence is the same as the sequence of the same 
length in a WD-40 region of a second protein which interacts with a first 
protein/ which method comprises culturing the cells of claim 15 under 
5 conditions wherein said nucleotide sequence is expressed to produce said 
polypeptide; and 

optionally recovering said polypeptide from the culture. 

17. A polypeptide composition effective to alter the 
10 activity of a protein kinase C, where the protein kinase C interacts with 

a second protein, and the second protein contains at least one WD-4 0 
region, 

said polypeptide having between 4 and 50 amino whose 
sequence is the same as a sequence of the same length in the WD-40 region 
15 of the second protein. 

18. The composition of claim 17, wherein said second 
protein is a receptor for activated protein kinase C. 

20 19. The composition of claim 18 , where said second protein 

has the sequence represented by SEQ ID NO: 27. 

20. The composition of claim 17, wherein said polypeptide 
is an agonist of the activity of protein kinase C; and/or wherein said 

25 polypeptide is an antagonist 6f the activity of protein kinase C; and/or 
wherein said polypeptide inhibits interactions between protein kinase C 
and the second protein. 

21. The composition of claim 20 wherein said polypeptide 
3 0 has the sequence represented by SEQ ID NO: 7, SEQ ID NO: 4 or SEQ ID NO : 2 . 

22. The composition of claim 17, wherein said WD-40 region 
has an amino acid sequence derived from the group consisting of SEQ ID 
NO:69-75 . 

35 

23. The composition of claim 22, wherein said WD-40 region 
has an amino acid sequence selected from the group consisting of SEQ ID 
N0:69-75 . 



25 . A method to bind selectively protein kinase C which 
method comprises contacting a sample putatively containing protein kinase 
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removing any unbound components of the sample from said 

composition. 

26. A method to assess the interaction of protein kinase 
C with a polypeptide having a sequence the same as a sequence of the same 
length contained in the WD-40 region of a second protein, which method 
comprises 

contacting a sample containing said protein kinase C 
with a polypeptide composition wherein the polypeptide has between 4 and 
50 amino acids whose sequence is the same as the sequence of the same 

1 F>nrrt- H -in i*rn_ a c\ >-<=~-; ~ -c *- t — a .. . - 

— ocuuiiu jjiuitiin , ana observing any 

interaction of the protein kinase C with said polypeptide composition. 

27. A method to assess the ability of a candidate compound 
15 to bind protein kinase C which method comprises contacting said protein 

kinase C with a polypeptide composition which binds said protein kinase 
C, wherein the polypeptide of said composition has between 4 and 50 amino 
acids whose sequence is the same as a sequence of the same length in a 
WD-4 0 region of a second protein which interacts with said protein kinase 

2 0 c, in the presence and absence of said candidate compound; and 

measuring the binding of said polypeptide in the presence 
and in the absence of said candidate, 

wherein decreased binding of the polypeptide in the presence 
as opposed to the absence of said candidate indicates that said candidate 

25 binds to said protein kinase C. 

i 

28. A method to alter the activity of protein kinase C 
that interacts with a second protein, where the second protein contains 
at least one WD-40 region, comprising 

selecting a polypeptide having between 4 and 50 amino acids 
whose sequence is the same as a sequence of the same length in the WD-40 
region in the second protein, and 

contacting said polypeptide with said protein kinase C under 
conditions which allow the formation of a complex between the polypeptide 
and the protein kinase C, where said interaction alters the activity of 
said protein kinase C. 



30 



35 



contacting is effective to inhibit the activity of said protein kinase 
C. 
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30. The method of claim 29, wherein said polypeptide has 
an amino acid sequence represented by SEQ ID NO: 2, SEQ ID NO: 4 or SEQ ID 
NO: 7. 

5 31. The method of claim 28, wherein said polypeptide is 

derived from the group consisting of SEQ ID NO: 69-75. 

32. The method of claim 31, wherein said polypeptide is 
selected from the group consisting of SEQ ID NO:69-75. 

10 

33 A composition of DNA molecules which consists of DMA 
molecules having a nucleotide sequence of encoding the polypeptide of any 
of claims 17-23. 

15 34 . A DNA molecule which comprises an expression system for 

the production of the polypeptide of any of claims 17-23 which expression 
system comprises a nucleotide sequence encoding said polypeptide operably 
linked to control sequences capable of effecting the expression of said 
encoding nucleotide sequence. 

20 

35. Recombinant host cells modified to contain the 
expression system of claim 34 . 

36. A method to produce a polypeptide having between 4 and 
25 50 amino acids whose sequence is the same as the sequence of the same 

length in a WD-40 region of a second protein which interacts with protein 

kinase C, which method comprises culturing the cells of claim 3 5 under 
conditions wherein said nucleotide sequence is expressed to produce said 
polypept ide ; and 

30 optionally recovering said polypeptide from the culture. 
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Fig, 11 

Human 56 kDa protein (PWP homolog) 
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CDC4 / CDC20 protein 
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Coronin (p55) 
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HUMAN 12.3 
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IEF -7442 - human 
i maskemfedt veervineey kiwkkntpfl ydlvmthalq wpsltvqwlp 
51 evtkpegkdy alhwlvlgth tsdeqnhlw arvhipndda qfdashcdsd 
101 kgefggfgsv tgkieceiki nhegevnrar ympqnphiia tktpssdvlv 
151 fdytkhpakp dpsgecnpdl 



171 rlrflliqk^gyglswnsnlsghn^asddhtvcliKjiinagpkegkivdaka 
221iftflhsajvedvawhUheslfgjvQddqklmiwdtrsnt 
tskpshl vdahtaf vncl sf npysef i \at gsadktvalHd.1 ml kl kl h 



261 

311 tfeshkdeifqvhws||hneti 
361 qppeUfihqgh.tal[:isdfsvvnpnepwvicjvsednimqiwqmQeniynd 



lassgtdrrlnvjKdlskigeeqsaedaed 



411 eesdvttsel egqgs 



SUBSTITUTE SHEET (RULE 26) 



WO 95/21252 



PCT/US95/01210 



31 / 53 



insulin-like growth factor binding protein complex 
1 malrkgglal allllswval gprslegadp gtpgeaegpa cpaacvcsyd 
51 ddadelsvfc ssrnltrlpd gvpggtqalw Idgnnlssvp paafqnlssl 
101 gflnlqggql gslepqallg lenlchlhle rnqlrslalg 



141 tfahtpalaslglsnnrlsrledglfeglgslMtdlnlgwn slavlpdaaf 



rglgslrelv 



201 lagnrlaylq palfsglael reldlsrnal raikanvfvq Iprlqklyld 

251 rnliaavapg aflglkalrw Idlshnrvag lledtfpgll glrvlrlshn 

301 aiaslrprtf kdlhfleelq Ighnrirqla ersfeglgql evltldhnql 

351 qevkagaflg Itnvavmnls gnclrnlpeq vfrglgklhs Ihlegsclgr 

401 irphtftgls glrrlflkdn glvgieeqsl wglaelleld Itsnqlthlp 

451 hrlfqglgkl eylllsrnrl aelpadalgp Iqrafwldvs hnrlealpns 

^qnpsavpi vauLLeqaci _ uppuvrvrv . asppevv ... . . bci. 
601 hfapc 

Fig 32 



WO 95/21252 



PCT/US95/01210 



32 / 53 

insulin like growth factor binding protein complex - rat 
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